Senior Site Reliability Engineer I

Axon Axon · Enterprise · Atlanta, GA · 1505 SAAS Ops

Senior Site Reliability Engineer responsible for designing, building, and maintaining cloud infrastructure and automation platforms across multi-cloud environments (Azure, AWS) with a focus on FedRAMP compliance and large-scale Kubernetes. The role involves significant coding in Go/Python, incident response, and influencing architectural patterns, with an emphasis on leveraging AI tooling for engineering delivery and operational automation.

What you'd actually do

  1. Design and implement services, APIs, and automation tooling that improve cloud infrastructure reliability, scalability, and operational efficiency.
  2. Write production-quality code in Go, Python, or similar. Lead design reviews and set code quality standards within the team.
  3. Build robust, easy-to-use foundational platforms and tools that enable engineering teams to provision services rapidly, consistently, securely, and cost-effectively.
  4. Architect cloud infrastructure across Azure and AWS, including Kubernetes platforms, networking, and identity boundaries.
  5. Participate in on-call rotations and incident response. Use operational experience to identify systemic reliability risks and drive platform improvements.

Skills

Required

  • 8+ years of applicable experience in software engineering, cloud infrastructure, or site reliability engineering.
  • Experience designing and operating cloud platforms at scale across Azure, AWS, or similar.
  • Proficiency in one or more managed languages (Go, Python, Java, C#) in the context of building services, not just scripting.
  • Experience architecting and operating Kubernetes platforms (AKS, EKS, or similar) at production scale.
  • Experience with Infrastructure as Code (Terraform, CloudFormation, CDK, or similar) for managing multi-environment, multi-region infrastructure.
  • Experience utilizing CI/CD platforms to automate provisioning infrastructure, software builds, tests, and releases.
  • Experience using observability tools such as APM, logging, and metrics to assist with debugging issues.
  • Track record of leading design reviews, writing technical proposals, and setting engineering standards within a team.
  • Builder-operator mindset with proven production ownership (uptime, SLOs, on-call, incident leadership).
  • Experience mentoring engineers and conducting technical interviews.

Nice to have

  • Experience using AI-assisted development tools to accelerate engineering workflows is a strong plus.

What the JD emphasized

  • handling of classified federal data
  • open to U.S. citizens only