Senior Site Reliability Engineer (auth0)

Okta Okta · Enterprise · Spain · Remote · Tech Ops-610

Senior Site Reliability Engineer at Okta (Auth0) focused on ensuring the reliability, resiliency, and scalability of production systems. The role involves designing and building custom software in Go, partnering with engineering teams, and contributing to SRE tooling and processes. Requires strong experience in infrastructure as code, container orchestration, cloud providers, and SRE principles.

What you'd actually do

  1. Design and build custom software in Go to enhance the platform's reliability, resiliency, and redundancy.
  2. Partner with engineering teams to embed reliability principles, improving the availability, performance, and observability of our services.
  3. Use your deep understanding of infrastructure and observability principles to identify opportunities for improvement within the product and implement solutions.
  4. Contribute to our on-call rotation, providing rapid, effective response to critical incidents and using your expertise to troubleshoot, mitigate or accurately escalate production issues.
  5. Develop and refine our SRE tooling and processes, focusing on automation and operational efficiency.

Skills

Required

  • Go
  • Terraform
  • Kubernetes
  • Docker
  • ArgoCD
  • Azure, AWS, or GCP
  • microservices architecture
  • SQL
  • NoSQL
  • networking fundamentals
  • SLIs, SLOs, and error budgets
  • on-call rotation

Nice to have

  • custom applications
  • GitOps

What the JD emphasized

  • custom software in Go
  • reliability principles
  • observability
  • on-call rotation
  • SRE tooling and processes
  • automation
  • reliability best practices
  • software engineer's mindset
  • operational expertise
  • high degree of ownership
  • large-scale, mission-critical applications
  • high degree of autonomy
  • major cloud provider
  • microservices architecture
  • networking fundamentals
  • SLIs, SLOs, and error budgets
  • on-call rotation for a 24/7 cloud-based environment
  • remote, distributed team