Member of Technical Staff, Principal Engineering Manager

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Software Engineering

Seeking an experienced engineering leader to build, scale, and run a high-performing engineering organization responsible for Copilot AI Evaluation. This role involves setting technical and organizational strategy for LLM evaluation, partnering with senior leadership, and owning the delivery of evaluation platforms and novel techniques to measure and improve Copilot quality at scale.

What you'd actually do

  1. Build and lead a multi-team engineering organization (30+ engineers across multiple teams), including hiring and developing engineering managers who lead their own teams.
  2. Set the technical and organizational strategy for Copilot AI Evaluation and response quality, aligning with MAI's broader product and engineering vision.
  3. Partner with senior Eng and Product leadership (Partner+ level) to define priorities, influence roadmaps, and drive cross-organizational initiatives.
  4. Own end-to-end delivery of evaluation platforms, novel evaluation techniques, and agentic solutions for measuring and improving Copilot quality at scale.
  5. Recruit, develop, and retain world-class engineering talent — building a culture of technical excellence, accountability, and continuous learning.

Skills

Required

  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, Javascript, or Python OR equivalent experience

Nice to have

  • Master's Degree or PhD in Computer Science or related technical field AND 15+ years of engineering experience, including 8+ years of people management experience.
  • Demonstrated track record of building and scaling engineering organizations (hiring teams from scratch, structuring orgs, growing managers).
  • Experience delivering large-scale software systems in AI, machine learning, or related fields.
  • Experience managing organizations of 30+ engineers across multiple teams and workstreams.
  • Deep expertise in LLM evaluation, AI quality measurement, or ML infrastructure at scale.
  • Track record of partnering with senior leadership (VP/CVP level) to set strategy and drive cross-organizational programs.
  • Experience recruiting and developing senior engineering talent (principal engineers, engineering managers) in a competitive market.
  • Proven ability to operate effectively in fast-paced, ambiguous environments — comfortable making decisions with incomplete information and course-correcting quickly.
  • Strong technical judgment: ability to evaluate architectural tradeoffs, assess technical risk, and guide teams toward sound engineering decisions without needing to write the code yourself.
  • Experience leading distributed or multi-site engineering teams.

What the JD emphasized

  • managing managers
  • LLM evaluation
  • AI quality measurement
  • large-scale software systems in AI
  • managing organizations of 30+ engineers

Other signals

  • AI evaluation platforms
  • measuring and improving Copilot quality at scale
  • LLM evaluation
  • AI quality measurement