Microsoft has 521 active AI-related job listings. The majority of these roles are focused on agents, representing 37% of the total, followed by application and serving infrastructure. Engineering is the most frequent function, with a significant number of openings, and the United States is the primary hiring country. Frequent tech tags include agent orchestration, model serving, and LLM observability, suggesting a focus on operationalizing AI models. Over the last 30 days, Microsoft has added 280 new AI roles, a 157% increase compared to the previous 30-day period.
Currently tracking 250 active AI roles, down 24% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $65k–$331k (avg $195k).
Microsoft currently has 343 active AI-related roles in our index. The most common open titles are: Principal Software Engineer (19), Senior Software Engineer (19), Software Engineer II (8), Principal Applied Scientist (7), Principal Data Scientist (4). Most positions are in Engineering and Research.
Microsoft's active AI hiring is concentrated in: agents (36%), application (21%), serving infrastructure (19%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Microsoft is hiring AI talent in: United States (308 roles), Canada (15 roles), Japan (8 roles), United Kingdom (7 roles).
Job postings at Microsoft most frequently mention: Computer Architecture, Python, Machine Learning, C#, C++.
In the past 30 days, Microsoft has posted 227 new AI-related roles.
| Title | Stage | AI score |
|---|---|---|
| Member of Technical Staff, Evaluations Engineering - MAI Superintelligence Team This role focuses on building and scaling the evaluation infrastructure for generative AI models on large-scale GPU clusters. It involves developing sophisticated tools and techniques for reliability, performance, and health monitoring, and collaborating with model scientists on evaluation methods and inference strategies. The role also touches on pretraining software development and benchmarking. | Eval GateServe | 9 |
| Technical Solution Management Specialist - AI Evaluation, Research, and Technical Architecture Func. This role focuses on establishing and operating an AI Innovation Lab within Microsoft's Customer Experience and Success organization. The primary responsibility is to research, validate, and evaluate early-stage AI opportunities (pre 0-to-1) before they are handed off for prototyping or product engagement. This involves defining research frameworks, building evaluation approaches for various AI workloads, developing automated testing, and creating reference architectures. The role acts as a bridge between business strategy, research, and technical execution, ensuring AI innovations are rigorously tested and aligned with Responsible AI standards. |
| Eval GateData |
| 8 |
| Senior Data Scientist Senior Data Scientist role focused on developing and implementing methodologies to evaluate LLM performance for Copilot, including training classifiers, experimenting with data collection, and providing real-time performance signals. The role involves creating automated evaluation frameworks and working closely with user researchers and product leaders. | Eval Gate | 8 |
| Senior Software Engineer - Responsible AI (CoreAI) Senior Software Engineer focused on building Responsible AI services, including identifying, measuring, mitigating, and monitoring AI risks across various content types. The role involves designing and developing large-scale distributed cloud services with a focus on safety, governance, inference, evaluation, and multimodal safety infrastructure. | Eval GateAgent | 8 |
| Senior Researcher - AI & Society - Microsoft Research Senior Researcher at Microsoft Research focusing on the intersection of AI systems and society, with an emphasis on sociotechnical approaches to AI evaluation, responsible AI in industry, and AI safety. The role involves interdisciplinary research, collaboration with industry teams, and a strong publication record. | Eval Gate | 8 |
| Research Intern - AI Evaluation and Alignment Research Intern role focused on advancing the quality, reliability, and evaluation of LLM-based systems by exploring new ML methods for AI assessment and alignment. Responsibilities include co-developing research projects, implementing ML approaches (training/fine-tuning), and developing evaluation frameworks. Requires PhD enrollment in a technical field and hands-on LLM experience. | Eval GatePost-train | 8 |
| Research Intern - STAC, NYC (Sociotechnical Alignment Center) Research Intern position at Microsoft's Sociotechnical Alignment Center (STAC) focusing on evaluating AI systems, particularly generative ones. The role involves applying measurement theory from social sciences and statistics to assess risks, capabilities, and performance. Collaboration with Fairness, Accountability, Transparency, and Ethics in AI (FATE) researchers is expected. The internship emphasizes theoretical and methodological approaches to advance AI system evaluation. | Eval Gate | 8 |
| Principal Applied Scientist, Experimentation Platform - CoreAI This role focuses on building and scaling an experimentation platform for AI products, enabling teams to evaluate, refine, and safely deploy new AI innovations. It involves pushing the envelope on online experimentation methodology and agent evaluations, collaborating with various engineering and science teams, and translating applied research into production-quality features. | Eval GateAgent | 7 |
| Senior Security Researcher Senior Security Researcher role focused on threat hunting within Microsoft Defender Experts. The role involves exploring large datasets to detect advanced attack techniques, generating custom alerts, collaborating with data science and threat research teams, and building hunting tools and automations. Requires a strong background in cybersecurity, data analysis, and potentially machine learning, with a focus on enterprise security and threat intelligence. | Eval Gate | 7 |
| Senior Software Engineer Senior Software Engineer role focused on building AI-powered operational excellence for Azure Reliability. The role involves developing evaluation loops, generalizing ML solutions into frameworks, operationalizing prompted classifiers at scale, and ensuring responsible AI practices. | Eval GateAgent | 7 |
| Research Intern - Inference Economics and Human Agency Research intern to conduct empirical research on how human oversight shapes the economic return of AI-assisted work. This includes designing controlled experiments, developing session-level evaluation frameworks that link inference cost to output quality and human effort, and analyzing how interface design choices affect user confidence, reliance, and decision quality during AI-assisted tasks. The role involves collaboration with the MADE team and preparing a submission-ready research manuscript. | Eval GateAgent | 7 |
| Member of Technical Staff, Principal Engineering Manager Seeking an experienced engineering leader to build, scale, and run a high-performing engineering organization responsible for Copilot AI Evaluation. This role involves setting technical and organizational strategy for LLM evaluation, partnering with senior leadership, and owning the delivery of evaluation platforms and novel techniques to measure and improve Copilot quality at scale. | Eval GateAgent | 7 |
| Member of Technical Staff - Copilot AI Evaluation Engineering Manager Lead a team of engineers to build and manage LLM evaluation solutions for Microsoft Copilot, focusing on quality, reliability, and scalability. This role involves designing evaluation platforms and techniques to measure and improve the performance of AI companions. | Eval Gate | 7 |
| Senior Software Engineer - CoreAI Senior Software Engineer to join the Evaluation platform team within Core AI, focusing on building core services for large-scale agent observability and optimizing AI agent performance. | Eval GateAgent | 7 |
| Principal Applied Scientist, Experimentation Platform - CoreAI The Principal Applied Scientist will work on Microsoft's Experimentation Platform (ExP) within CoreAI, focusing on enabling high-scale online experimentation for AI-driven applications. This role involves advancing experimentation methodology and agent evaluations, collaborating with various engineering and science teams, and translating applied research into production features for a large-scale platform. The goal is to accelerate product learning and drive progress across Microsoft's AI ecosystem by providing robust experimentation capabilities. | Eval GateAgent | 7 |
| Principal Researcher - AI & Society - Microsoft Research Principal Researcher at Microsoft Research focusing on the intersection of AI and society, with an emphasis on sociotechnical approaches to AI evaluation, measurement, and responsible AI in industry. The role involves interdisciplinary research, collaboration with engineering and policy teams, and a strong publication record. | Eval Gate | 7 |
| PostDoc Researcher-FATE (Fairness, Accountability, Transparency, and Ethics in AI-Microsoft Research Postdoctoral Researcher position at Microsoft Research NYC focusing on Fairness, Accountability, Transparency, and Ethics in AI (FATE). The role involves pursuing an independent research agenda, collaborating with researchers, and contributing to ongoing projects related to the social implications of machine learning and AI. Research areas include AI evaluation, responsible AI in industry, AI law and policy, transparency, human-AI interaction, and various social impacts of AI. | Eval Gate | 7 |
| Software Engineer II Full Stack Software Engineer to build capabilities for Microsoft Copilot, working across evaluation platform stages: data sampling, AI data collection/processing, data analysis/evaluation, and insight generation. Responsibilities include experimentation, pipeline design, data analysis, and dashboard creation, with collaboration across M365 teams to assess Copilot's performance. Familiarity with LLMs, prompt engineering, and cloud infrastructure is preferred. | Eval GateData | 5 |
| Principal Software Engineer This role focuses on building and enhancing the evaluation platform for M365's AI offerings, enabling builders to run faster and more comprehensive evaluations throughout the development lifecycle. The goal is to automate tasks and improve performance understanding of AI features. | Eval Gate | 5 |
| Senior Software Engineer The role is for a Senior Software Engineer on the Evaluation Platform Team at Microsoft, focusing on building and improving systems that measure and evaluate AI quality for M365 AI products. The goal is to create reliable, scalable, and user-friendly tools to support various stages of AI evaluation, from fine-tuning to launching new features and onboarding partners. | Eval Gate | 5 |
| Member of Technical Staff - Full Stack Software Engineer Full Stack Software Engineer to build capabilities for Microsoft's personalized AI assistant, Copilot. The role involves working across the evaluation platform, including data sampling, collection, processing, analysis, and insight generation. Responsibilities include full-stack development, prompt engineering, leveraging AI tools, and collaborating with teams to assess Copilot's performance, trustworthiness, and visual appeal across various platforms and scenarios like multi-turn conversations with voice input. | Eval Gate | 5 |
| Software Engineer II - CoreAI This role focuses on building core services for an AI evaluation platform, specifically for agent observability within Microsoft's CoreAI group. The engineer will design, implement, and deliver AI services to support product offerings for large-scale agent observability, collaborating with product management and partner teams, and taking end-to-end responsibility for development lifecycle and production readiness. | Eval GateAgent | 5 |
| Principal Software Engineer The Principal Software Engineer will join the M365 Evaluation Platform Team to enhance the evaluation system for AI offerings, supporting millions of users. The role involves building capabilities to enable agile and faster evaluations, providing continuous tools throughout the development lifecycle, and automating tasks via tools or agents to improve performance understanding. The focus is on building reliable, scalable infrastructure and driving quality in products using data, with a platform engineering mindset. | Eval Gate | 5 |