Site Reliability Engineer, Customer Systems

Apple Apple · Big Tech · Sunnyvale, CA +1 · Software and Services

Site Reliability Engineer role focused on designing, building, and delivering scalable, reliable, and secure cloud infrastructure for customer-facing applications and services. Responsibilities include infrastructure as code, automation, intelligent monitoring, and collaborating with cross-functional teams. Requires experience with Kubernetes, scripting (Shell, Python, Ansible), monitoring tools, and networking protocols.

What you'd actually do

  1. Innovate, architect, build, and document highly available, scalable, reliable, secure Infrastructure
  2. Troubleshoot application specific, network, system & performance issues
  3. Build and maintain CI/CD infrastructure to enable fast delivery cycles for software engineering teams
  4. Envision and build automation tools to deliver infrastructure services reliably and in a repeatable fashion
  5. Collaborate with other site reliability engineers, software engineers, quality engineers, to gather, define, and analyze non-functional/technical requirements

Skills

Required

  • Kubernetes
  • Helm
  • Shell Scripting
  • Python
  • Ansible
  • Splunk
  • Grafana
  • Prometheus
  • Alertmanager
  • DNS
  • TCP
  • HTTP/HTTPS
  • CI/CD pipelines
  • Computer Science

Nice to have

  • java applications
  • ArgoCD
  • GitOps
  • MTTR
  • SLO
  • GenAI tools
  • workflow automation
  • infrastructure management
  • problem solving
  • critical thinking
  • interpersonal skills
  • communication skills