Senior Manager, Site Reliability Engineering (federal)

Okta Okta · Enterprise · Washington, DC · Tech Ops-610

Okta is seeking a Senior Manager, Site Reliability Engineering (Federal) to lead teams responsible for infrastructure platform and shared services, including Edge networking, K8s platform, CI/CD, Observability, and automation tooling. The role focuses on scaling SaaS services, driving DevOps maturity, and ensuring reliability and efficiency for federal environments. While the company mentions AI as a driver for their identity solutions, this role is primarily focused on the underlying infrastructure and SRE practices, not direct AI/ML model development.

What you'd actually do

  1. Managing a team of SRE’s supporting our various workloads operating in private sector environments.
  2. Drive the microservice journey, DevOps maturity, and workload reliability in tandem with architects and teams across the organization.
  3. Accelerate the velocity of SRE and product engineering by developing powerful tooling, intuitive self-service capabilities, and robust self-healing patterns.
  4. Lead, mentor, and grow a high-performing team of engineers and managers across platform, infrastructure, and shared services domains.
  5. Perform engineering design evaluations and ensure the completion of projects within resource, budget, and scheduling constraints.

Skills

Required

  • 3+ years of experience in technical leadership & people management
  • Extensive experience using Agile and DevOps methodologies to build product infrastructure and shared service at scale
  • Experience running large-scale infrastructure platforms supporting a SaaS/Cloud service in a public Cloud, preferably AWS.
  • Strong expertise in cloud-native architectures, containerization (Kubernetes), IaC (Terraform), and CI/CD pipelines
  • Strong background and hands-on experience in SW development, PaaS and automation
  • Deep experience with building and operating observability platforms and monitoring tools (Grafana, Splunk, APM etc.) in a large scale environment.
  • Effective verbal, written communication and interpersonal skills
  • Computer Science Degree or related degree or equivalent experience
  • ability to access federal environments
  • access to protected federal data
  • U.S. Person status

Nice to have

  • Experience supporting a multi-Cloud environment will be a plus.

What the JD emphasized

  • This position requires the ability to access federal environments and/or have access to protected federal data.
  • successful candidate must be able to submit documentation establishing U.S. Person status