Principal Software Engineer, Foundry Agents - Coreai

Microsoft Microsoft · Big Tech · Redmond, WA +2 · Software Engineering

Principal Software Engineer role focused on building foundational platforms for intelligent agents and generative AI systems. Responsibilities include developing large-scale, cloud-native systems for agent deployment, execution, tool integration, fine-tuning, training, observability, evaluation, and optimization in production. The role operates at the intersection of distributed systems, AI infrastructure, and developer platforms, requiring strong systems thinking and architectural decision-making for systems demanding high performance, reliability, security, and compliance.

What you'd actually do

  1. Build and scale foundational services that power secure agent deployment and execution, governed tool integration, training/fine‑tuning, and observability/evaluation
  2. Design and evolve distributed runtimes and cloud services to run agents securely at enterprise scale with strong reliability, security, and compliance guarantees
  3. Improve iteration loops for researchers, engineers, and developers through better tooling, abstractions, and automation across fine‑tuning, evaluation, and optimization
  4. Debug and optimize complex interactions across models, data, hardware, and infrastructure
  5. Drive technical direction across the full software development lifecycle, influencing design tradeoffs and long‑term architecture

Skills

Required

  • Computer Science or related technical field
  • 6+ years technical engineering experience
  • coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Python
  • Microsoft Cloud Background Check

Nice to have

  • Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience
  • 15+ years technical engineering experience
  • coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • equivalent experience

What the JD emphasized

  • secure agent deployment
  • governed tool integration
  • enterprise scale
  • fine-tuning
  • training
  • observability
  • evaluation
  • optimization
  • compliance

Other signals

  • agent lifecycle
  • enterprise scale
  • developer platforms
  • distributed systems
  • AI infrastructure