Principal Research Engineer - Agent 365

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Software Engineering

Principal Research Engineer role focused on architecting and delivering scalable AI systems for Microsoft 365 Copilot experiences, including foundational models, multi-agent systems, and RAG. The role involves technical leadership, driving innovation, production integration, evaluation, and ensuring responsible AI practices. It requires experience with LLMs, multimodal models, multi-agent architectures, and RAG, with a focus on end-to-end ML pipelines and deployment at scale.

What you'd actually do

  1. Architect and deliver AI systems across model development, data, infra, evaluation, and deployment spanning multiple product lines.
  2. Set technical direction for large programs; drive alignment across Research, Engineering, and Product.
  3. Integrate LLMs, multimodal models, multi-agent architectures, and RAG into Microsoft’s ecosystem.
  4. Establish standards for MLOps, governance, and Responsible AI, compliant with Microsoft principles and industry standards.
  5. Drive original research and thought leadership (whitepapers, internal notes, patents); convert insights into shipped capabilities.

Skills

Required

  • Bachelor's Degree in Computer Science or related technical field
  • 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Ability to meet Microsoft, customer and/or government security screening requirements

Nice to have

  • Bachelor’s Degree in CS/EE/Math or related field
  • 10+ years in applied AI/ML research and product engineering
  • PhD in AI/ML or related field with top-venue publications and/or patents
  • Experience architecting and deploying LLMs/multimodal models and multi-agent systems in production at scale
  • Familiarity with Responsible AI frameworks and bias-mitigation techniques
  • Experience shaping product strategy and driving organizational change
  • Experience with Microsoft’s LLMOps s

What the JD emphasized

  • architecting and deploying LLMs/multimodal models and multi-agent systems in production at scale

Other signals

  • architecting and deploying LLMs/multimodal models and multi-agent systems in production at scale
  • define and execute technical strategy for foundational models, multi-agent systems, and next-generation Copilot experiences
  • ML Design & Architecture: Own end-to-end pipeline from data prep, training, evaluation, deployment, and feedback loops