Principal Software Engineer at Microsoft

What you'd actually do

Collaborate broadly with ML researchers, system engineers, and production engineers.

Engage with key partners to understand and evaluate performance and quality for state-of-the-art LLMs at different scales.

Build software tools to support validation and exploration of LLM optimization technologies.

Perform software development in model scripting and/or kernel languages, such as Python, C/C++, CUDA.

Identify requirements, scope solutions, estimate work, schedule deliverables.

Skills

Required

Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience
coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
Ability to meet Microsoft, customer and/or government security screening requirements
Microsoft Cloud Background Check

Nice to have

Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience
Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience
Experience in training or serving Deep Neural Network models
Experience with Language Models and ML system optimization

Other signals

Develops AI software for training and deploying advanced AI models

Builds software stacks for supercomputers and AI accelerators

Optimizes and scales model training and inference

Works with OpenAI on Azure OpenAI service models

Enables large scale inferencing and training of advanced AI models on novel AI hardware

Overview

The Artificial Intelligence (AI) Frameworks team at Microsoft develops the AI software used to train and deploy the world’s most advanced AI models. We collaborate with our hardware teams and partners to build the software stacks for Microsoft’s next-generation supercomputers and the Maia AI accelerators. We work closely with ML researchers and developers to optimize and scale out model training and inference. We work with OpenAI on the models hosted on the Azure OpenAI service.

The team operates at the intersection of AI algorithmic innovation, purpose-built AI hardware, systems, and software. We are a cross-discipline team of highly capable and motivated people with a collaborative and inclusive culture and with a shared mission of supporting and driving our AI future.

As a member of this team, you will have the opportunity to work on developing and evaluating core algorithmic and hardware technologies to enable large scale inferencing and training of the most advanced AI models on novel AI hardware.

This is a technical role: it requires hands-on software design and development skills. We’re looking for someone who has a demonstrated history of solving hard technical problems and is motivated to learn new areas and tackle the hardest problems in building efficient AI systems.

Responsibilities

Collaborate broadly with ML researchers, system engineers, and production engineers.
Engage with key partners to understand and evaluate performance and quality for state-of-the-art LLMs at different scales.
Build software tools to support validation and exploration of LLM optimization technologies.
Perform software development in model scripting and/or kernel languages, such as Python, C/C++, CUDA.
Identify requirements, scope solutions, estimate work, schedule deliverables.

Qualifications

Required Qualifications:

Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

**Other Requirements: **

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:

Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Experience in training or serving Deep Neural Network models.
Experience with Language Models and ML system optimization.

#AIInfra

Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about **requesting accommodations.**