Site Reliability Engineer (mid / Senior) - Platform Infrastructure

Elastic Elastic · Enterprise · Spain · Platform - SRE

This role is for a Site Reliability Engineer on the Platform Infrastructure team at Elastic. The team designs and develops tooling for building, testing, and shipping the Elastic Stack, and operates production services critical to the business. The role requires a strong software development background, SRE experience, Linux administration skills, and proficiency in infrastructure-as-code tools.

What you'd actually do

  1. Design and develop tooling that facilitates building, testing, and shipping the Elastic Stack
  2. Build and operate production services that power core aspects of the Elastic business, including downloads, Docker registry, maps service, and more
  3. Support internal adoption of the Elastic Stack for software development and analytics use cases

Skills

Required

  • Software Developer
  • Python
  • JavaScript
  • Clojure
  • Haskell
  • Java
  • Go
  • Site-Reliability Engineering
  • Linux systems administration
  • SaaS platform operation
  • Infrastructure-as-code
  • Docker
  • Terraform
  • Kubernetes

Nice to have

  • Automate and monitor everything
  • Building reusable software components
  • Open source contributions
  • Versioned, Git-based workflow
  • Linux fundamentals
  • syscall tracing
  • TCP internals
  • init systems
  • Open source
  • Distributed, asynchronous work environment
  • Written communication
  • Diverse, globally distributed teams
  • Collaborative, inclusive approach