Principal Software Engineer - Coreai

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Software Engineering

Principal Software Engineer to build and scale the core serving systems, request routing, and distribution for all LLMs across Microsoft and Azure customers. The role focuses on delivering inference capabilities reliably, efficiently, and with ultra-low latency for a wide range of AI-powered product experiences.

What you'd actually do

  1. Build and scale the core serving systems and smart request routing and distribution for all LLMs (OpenAI, Mistral, Grok, DeepSeek and many others).
  2. Design, implement and deliver AI services to support product offerings for large-scale LLM serving.
  3. Innovate technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products.
  4. Ship new product features and improvements at a high velocity.
  5. Collaborate closely with product management and partner teams to align technical direction with business goals.

Skills

Required

  • C, C++, C#, or Java
  • coding
  • technical engineering experience

Nice to have

  • design and problem-solving
  • system performance
  • scalability
  • engineering best practices
  • distributed systems
  • request serving at scale
  • high-performance storage
  • distributed databases
  • networking across global-scale infrastructures
  • shipping with high velocity
  • iterative approaches
  • building high-quality, reliable systems at scale
  • leading complex technical initiatives
  • customer-obsessed approach
  • empathy
  • drive to deliver impactful solutions

What the JD emphasized

  • core serving systems
  • smart request routing and distribution
  • LLM inferencing workloads
  • ultra-low latency
  • large-scale LLM serving
  • high velocity

Other signals

  • serving LLMs at scale
  • low latency inference
  • foundational to Microsoft's AI strategy
  • powers all LLM inferencing workloads