Senior Software Engineer

Microsoft Microsoft · Big Tech · Hyderabad, TS, IN · Software Engineering

Senior Software Engineer to build and operate AI Agents as Service for cloud operations, focusing on agent capabilities, orchestration, evaluation, safety, and reliability in production environments.

What you'd actually do

  1. Take ownership of important areas of the Azure SRE Agent Platform, including agent capabilities, orchestration, evaluation, user experiences on different form factors and supporting platform services
  2. Build and iterate on agentic systems, including tools, planning and execution loops, evaluations, and safety mechanisms
  3. Design and ship reliable capabilities that improve incident detection, diagnosis, mitigation, and operational learning
  4. Use telemetry, experiments, evaluations, and user feedback to guide iteration and investment
  5. Contribute to resilient, observable systems that operate safely and effectively in production

Skills

Required

  • 7+ years of experience building production software
  • Strong understanding of Generative AI & software engineering fundamentals, data structures, and problem-solving
  • Ability to learn new technologies quickly and adapt to deliver customer and business impact

Nice to have

  • Hands-on experience of building and operating LLM powered agentic systems in production, with direct ownership over quality, reliability, and iterations
  • 4+ years of experience building and operating cloud platforms or distributed services, with depth in service architecture, deployment, and observability
  • Strong product mindset with a track record of owning ambiguous problem spaces and driving them to high-quality outcomes
  • Solid engineering fundamentals, including systems design, performance, and debugging in complex production environments
  • Track record of designing, running, and optimizing evaluations for agentic systems, including tools, prompts, and agent loops
  • Expertise with Kubernetes, container orchestration, or cloud-native infrastructure is a strong plus
  • Experience contributing to or leading open-source projects at scale is a plus

What the JD emphasized

  • production software
  • Generative AI
  • agentic systems in production
  • evaluations for agentic systems

Other signals

  • AI Agents as Service
  • virtual SRE teammates
  • agentic systems
  • production
  • customer impact