Research Intern - Model Optimization and Hw Acceleration

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Applied Sciences

Research Intern role focusing on optimizing LLMs, VLMs, and generative models, with an emphasis on hardware acceleration. The role involves designing experiments, developing algorithms, and collaborating on practical solutions, aiming to advance AI technologies.

What you'd actually do

  1. Conduct research in one or more of the following areas: LLMs, VLMs, and Generative Image & Video.
  2. Design and implement experiments to test new hypotheses and validate research findings.
  3. Develop and optimise algorithms and models for various AI applications.
  4. Collaborate with team members to integrate research outcomes into practical solutions.
  5. Present research findings to the team and contribute to publications and patents.

Skills

Required

  • Enrolled in a full time bachelor's, master's, MBA, or PhD program in area in computer science, computer engineering, electrical engineering, or a related field during the academic term immediately before their internship.
  • At least one additional quarter/semester of school remaining following the completion of the internship.

Nice to have

  • First-author publication(s) at top-tier venues or demonstrably comparable research impact
  • Experience with emerging memory technologies such as CXL based memory
  • Clear communication, self-motivated, curiosity, and a bias for hands-on experimentation
  • Interpersonal skills, cross-group, and cross-culture collaboration.
  • Familiarity with computer systems' architecture, cache and memory hierarchy

What the JD emphasized

  • First-author publication(s) at top-tier venues or demonstrably comparable research impact

Other signals

  • LLMs
  • VLMs
  • Generative Image & Video
  • Model Optimization
  • HW Acceleration