Principal Software Engineer - Observability Common Infrastructure

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Software Engineering

The Principal Software Engineer will design, develop, and operate cloud-native platform components on Kubernetes for Observability services within Azure Data. The role focuses on building scalable, reliable, and secure infrastructure for AI-enabled capabilities, with a strong emphasis on Kubernetes expertise and distributed systems.

What you'd actually do

  1. Lead the design, development, and operation of cloud-native platform components on Kubernetes, with a focus on reliability, networking, security, and observability at scale.
  2. Drive end-to-end architecture for large-scale, distributed systems supporting globally deployed services.
  3. Design and deliver highly available, scalable systems, ensuring strong performance, resilience, diagnosability, and cost efficiency.
  4. Provide technical leadership in Kubernetes-based infrastructure, including service-to-service communication, traffic management, and resiliency patterns.
  5. Partner across engineering, platform, and infrastructure teams to define and execute on cross-organizational technical strategy and long-term investments.

Skills

Required

  • Kubernetes
  • cloud-native architectures
  • distributed systems
  • C
  • C++
  • C#
  • Java
  • JavaScript
  • Python

Nice to have

  • Go
  • service-to-service communication
  • traffic management
  • resiliency patterns
  • observability at scale
  • AI-enabled capabilities

What the JD emphasized

  • deep Kubernetes expertise
  • Kubernetes internals
  • cloud-native architectures
  • Advanced Kubernetes networking, traffic management, and reliability patterns
  • distributed systems design
  • owning live production systems
  • on-call rotations
  • incident mitigation
  • operational excellence