Senior and Principal Applied Scientists - Coreai

Microsoft Microsoft · Big Tech · Redmond, WA +2 · Applied Sciences

This role focuses on the applied science foundation for observability in AI agents and multi-agent systems running at scale. It involves developing and applying scientific methods, evaluation frameworks, and measurement systems to understand, benchmark, diagnose, and improve agent-based systems in production. The role addresses unique observability challenges of AI agents, such as non-deterministic execution and emergent behaviors.

What you'd actually do

  1. Develop evaluation and measurement frameworks for single-agent and multi-agent systems, spanning quality, safety, reliability, cost, and behavioral consistency.
  2. Design methodologies that connect offline evals, online signals, and production telemetry to explain how prompt, tool, model, or orchestration changes affect real-world agent performance.
  3. Define scientifically grounded quality signals and benchmarks for agent systems, including task success, tool-use effectiveness, plan quality, failure modes, coordination quality, and user-perceived outcomes.
  4. Build models and analysis techniques that help detect regressions, identify root causes, and characterize agent behavior across diverse workflows and environments.
  5. Advance observability for AI systems through new approaches to trace analysis, agent health modeling, behavioral clustering, anomaly detection, and multi-agent coordination analysis.

Skills

Required

  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience
  • Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience
  • Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience
  • equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements

Nice to have

  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 9+ years related experience
  • Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience
  • Doctorate in Statistic

What the JD emphasized

  • AI agents
  • multi-agent systems
  • observability
  • evaluation frameworks
  • production telemetry
  • offline evals
  • online signals
  • agent behavior
  • agent systems
  • agent observability
  • agent execution
  • agent systems

Other signals

  • AI agents
  • multi-agent systems
  • observability
  • evaluation frameworks
  • production telemetry