Research Intern - Multimodal Language Models

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Applied Sciences

Research intern to explore efficient multimodal language models using techniques like model compression, quantization, and optimization for resource-constrained platforms. Focus on training strategies for vision-language tasks, prototype implementation, experiment design, and result analysis.

What you'd actually do

  1. Research Interns put inquiry and theory into practice.
  2. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life.
  3. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides.
  4. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community.
  5. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Skills

Required

  • Accepted or currently enrolled in a PhD program in Computer Science or related STEM field.

Nice to have

  • Foundation in machine learning and deep learning
  • multimodal language models
  • transformer architecture
  • efficient model design
  • compression
  • quantization
  • Proficiency in modern deep learning frameworks (e.g., PyTorch, DeepSpeed)
  • scalable model development and optimization
  • Proven ability to define and execute original research agendas
  • creativity and technical rigor
  • Motivation to publish in top-tier academic venues

What the JD emphasized

  • PhD program in Computer Science or related STEM field
  • Foundation in machine learning and deep learning
  • multimodal language models
  • transformer architecture
  • efficient model design
  • compression
  • quantization
  • modern deep learning frameworks (e.g., PyTorch, DeepSpeed)
  • scalable model development and optimization
  • define and execute original research agendas
  • creativity and technical rigor
  • publish in top-tier academic venues

Other signals

  • efficient multimodal language models
  • model compression
  • quantization
  • model optimization
  • vision-language tasks