Staff Site Reliability Engineer - Kubernetes

Okta Okta · Enterprise · United States · Tech Ops-610

Okta is seeking a Staff Site Reliability Engineer to build and manage Kubernetes platforms on AWS, focusing on reliability, scalability, security, cost optimization, and automation. The role involves hands-on experience with AWS infrastructure, Kubernetes, Helm, Karpenter, and Istio service mesh, as well as CI/CD pipelines and incident management.

What you'd actually do

  1. Kubernetes Platform Creation: Design, implement, and maintain highly available, scalable, and fault-tolerant Kubernetes platforms. Ensure clusters are optimized for production workloads, providing high resilience and operational efficiency.
  2. AWS Infrastructure Management: Build, manage, and optimize AWS cloud infrastructure, including EKS,ECS, S3, VPCs, RDS, IAM, and more. Implement best practices for cost management, scaling, and security within AWS.
  3. Helm Management: Utilize Helm to automate and streamline the deployment of applications and services to Kubernetes clusters. Create, maintain, and manage Helm charts for production-ready deployments.
  4. Karpenter Implementation: Implement and manage Karpenter to dynamically scale Kubernetes clusters in response to workload demands.
  5. Istio Service Mesh Management: Configure and manage Istio to provide service-to-service communication, security, and observability within the Kubernetes clusters. Enable fine-grained traffic management, service discovery, and policy enforcement.

Skills

Required

  • Kubernetes/Helm
  • Terraform
  • AWS
  • multi-region cloud environments
  • cloud-native architectures
  • Kubernetes platform creation, management, and optimisation
  • Helm for Kubernetes application deployment and management
  • Karpenter for dynamic scaling of Kubernetes clusters
  • Istio for service mesh
  • CI/CD pipelines and automation tools
  • scripting and automation skills in Python, Bash, or Go
  • monitoring, logging, and alerting tools

Nice to have

  • security best practices for cloud platforms and Kubernetes
  • Docker and containerization principles
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent professional experience)
  • CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer), or AWS Certified DevOps Engineer

What the JD emphasized

  • requires the ability to access federal environments and/or have access to protected federal data
  • submit documentation establishing U.S. Person status