Intermediate Site Reliability Engineer, Environment Automation

GitLab GitLab · Enterprise · India · Platforms Engineering

GitLab is seeking an Intermediate Site Reliability Engineer for their Environment Automation team. This role focuses on ensuring the reliability, scalability, security, and consistency of hundreds of isolated GitLab environments for customers. The engineer will work with infrastructure as code, automation, and observability to manage these environments across cloud providers and Kubernetes, directly impacting customer experience.

What you'd actually do

  1. Contribute to automating operational tasks across many GitLab environments, from initial provisioning and configuration updates to upgrades and routine maintenance, helping reduce manual work and improve reliability at scale under the guidance of senior team members.
  2. Help build and refine the observability stack for multi-tenant GitLab environments so we monitor the right signals across Kubernetes, cloud services, and GitLab applications, supporting early issue detection and basic capacity tracking.
  3. Assist in responding to platform alerts and incidents, collaborating with Environment Automation SREs and engineering teams to troubleshoot production issues across multiple tenants and document findings.
  4. Support planning and implementation of infrastructure changes, capacity expansions, and new service rollouts for Dedicated and other managed GitLab environments, contributing to efforts that improve resource efficiency and environment isolation.
  5. Develop and maintain scripts, automation tools, and infrastructure-as-code workflows that manage parts of the GitLab environment lifecycle, enabling more repeatable, self-service operations over time.

Skills

Required

  • Golang
  • Kubernetes
  • infrastructure automation

Nice to have

  • Terraform
  • Ansible
  • Helm Charts
  • omnibus-gitlab configurations

What the JD emphasized

  • required
  • required
  • required