What you'd actually do

work with a small team to investigate recent Small Language Models (SLM) architectures and techniques, such as recurrent transformers and universal transformers, as potential approaches for maximizing the throughput of Large Language Models (LLMs) with limited high-speed cache.

learn how to apply your model training skills at scale using Azure compute.

be mentored by a multidisciplinary team with expertise in both on-device implementation and literature/state-of-the-art (SotA) approaches.

collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community.

Skills

Required

Advanced Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field
At least 1 year of experience in investigating and modifying transformer-based AI models

Nice to have

Project portfolio, open-source code or other verifiable evidence of pursuit of state-of-the-art AI systems
experience with different platforms (SoC, GPU, NPU)

Other signals

investigate recent Small Language Models (SLM) architectures and techniques

maximizing the throughput of Large Language Models (LLMs) with limited high-speed cache

apply your model training skills at scale using Azure compute

mentored by a multidisciplinary team with expertise in both on-device implementation and literature/state-of-the-art (SotA) approaches

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

As an Applied Science Research Intern, you will work with a small team to investigate recent Small Language Models (SLM) architectures and techniques, such as recurrent transformers and universal transformers, as potential approaches for maximizing the throughput of Large Language Models (LLMs) with limited high-speed cache. For example, could a useful model be pinned to Very Tightly Coupled Memory (VTCM) in a Qualcomm System on Chip (SoC) for its entire lifecycle? Similarly, could this be achieved in the fast caches of Graphics Processing Units (GPUs) or cloud Neural Processing Units (NPUs)? The choice of hardware target will be based on the candidate’s passion for and experience with different platforms.

This opportunity will allow you to learn how to apply your model training skills at scale using Azure compute. In addition, you will be mentored by a multidisciplinary team with expertise in both on-device implementation and literature/state-of-the-art (SotA) approaches. This role will integrate you into the Applied Science Group (ASG) in Redmond and is not open to remote work; however, flexible working within Redmond and schedule flexibility are encouraged.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Qualifications

Required Qualifications

Currently pursuing an Advanced Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field.
At least 1 year of experience in investigating and modifying transformer-based AI models.

Other Requirements

Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.

Preferred Qualifications

Project portfolio, open-source code or other verifiable evidence of pursuit of state-of-the-art AI systems.

Applied Sciences IC2 - The base pay range for this internship is USD $5,610 - $11,010 per month.

There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $7,270 - $12,030 per month.

Applied Sciences IC3 - The base pay range for this internship is USD $6,710 - $13,270 per month.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-intern-pay

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about **requesting accommodations.**