Sr Distributed Systems Engineer (devops)

Workday Workday · Enterprise · Auckland, New Zealand

This role is for a Senior Distributed Systems Engineer (DevOps) at Workday, an AI platform company. The engineer will focus on designing and automating cloud infrastructure (AWS, GCP), managing CI/CD pipelines, and ensuring high availability and performance of distributed systems. The role involves collaborating with engineering teams, securing infrastructure, and building monitoring systems. While the company develops AI technology, this specific role is in DevOps/Infrastructure, not directly building AI models or agents.

What you'd actually do

  1. Design infrastructure and automated systems to support our distributed architecture
  2. Build and Manage CI/CD pipelines and constantly improve their reliability & speed, and reduce lead time for changes.
  3. Trace performance bottlenecks and identify optimizations and improvements at both the infrastructure and application level
  4. Collaborate with our engineering team to meet high SLO and SLA requirements from customers
  5. Maintain highly available web and backend systems that serve millions of users, and 1000’s of requests per second

Skills

Required

  • DevOps experience
  • database administration experience
  • orchestrating large scale distributed microservice deployments on Kubernetes and EC2
  • building and managing EKS clusters
  • AWS ecosystem expertise
  • Terraform
  • Python
  • TypeScript
  • Go
  • Bash
  • Kubernetes
  • Docker
  • Istio
  • Linkerd
  • PostgreSQL
  • DynamoDB
  • Redis
  • Kafka
  • RabbitMQ
  • SQS
  • Prometheus
  • Grafana
  • ELK
  • OpenSearch
  • GitHub CI
  • GitLab CI
  • networking
  • application-layer logic
  • Infrastructure as Code (IaC)

Nice to have

  • BS in Computer Science (or a related field) or equivalent practical experience in large-scale systems
  • Service Mesh
  • message queuing clusters
  • monitoring and logging stacks
  • security-centric pipelines
  • low-level networking
  • application-layer logic
  • security within Infrastructure as Code (IaC)
  • container builds
  • microservice deployments

What the JD emphasized

  • 7+ years DevOps experience
  • 5+ years database administration experience (Postgres, MariaDB, MSSQL)
  • 4+ years experience orchestrating large scale distributed microservice deployments on Kubernetes and EC2.
  • 4+ years experience building and managing EKS clusters and strong knowledge of the K8s ecosystem.