Senior Sre, Infrastructure & Platform

F5 F5 · Enterprise · Singapore

Senior Site Reliability Engineer focused on building automation, tooling, and internal platforms for a global, multi-datacenter infrastructure. The role involves developing internal tools and APIs in Python and Go, designing and maintaining Ansible automation, building CI/CD pipelines, and creating self-service interfaces within a PCI-DSS compliant environment. Key responsibilities include infrastructure as code, Kubernetes automation, cloud infrastructure automation, observability, and security/compliance automation.

What you'd actually do

  1. Design and develop internal tools, CLIs, and APIs (primarily in Go and Python) that enable infrastructure self-service, automate complex workflows, and improve operational efficiency
  2. Architect, develop, and refactor Ansible roles and playbooks across a large-scale inventory spanning 30+ datacenters, 80+ group variable files, and 40+ roles
  3. Design, write, and maintain GitLab CI pipelines for infrastructure automation, including multi-stage deployment workflows with linting, validation, canary testing, and regional rollout
  4. Develop automation for self-hosted Kubernetes cluster lifecycle management: provisioning, upgrades, scaling, and disaster recovery
  5. Automate PCI-DSS compliance workflows including CIS benchmark hardening, audit evidence collection, and configuration drift detection

Skills

Required

  • Python
  • Go
  • Ansible
  • Kubernetes
  • CI/CD
  • Site Reliability Engineering
  • Infrastructure as Code
  • API development
  • CLI development
  • GitLab CI
  • AWS
  • Azure
  • HashiCorp Vault
  • NetBox
  • Proxmox
  • iLO Redfish

Nice to have

  • Flask/FastAPI
  • custom Ansible modules/plugins
  • Kubernetes operators/controllers
  • Helm charts
  • GitOps
  • observability tools
  • security scanning integration

What the JD emphasized

  • PCI-DSS compliant environment
  • 5+ years of experience in an SRE, DevOps, or Infrastructure Engineering role with a strong emphasis on writing code and building automation
  • Expert-level Ansible skills