Site Reliability Engineer (mid / Senior) - Platform Infrastructure

Elastic Elastic · Enterprise · Spain · Platform - SRE

This role is for a Site Reliability Engineer on the Platform Infrastructure team at Elastic, a company that leverages AI in its search platform. The engineer will design and develop tooling for building, testing, and shipping the Elastic Stack, and build/operate production services. They will need strong software development and SRE experience, proficiency in Linux systems administration, and experience with infrastructure-as-code tools.

What you'd actually do

  1. Design and develop tooling that facilitates building, testing, and shipping the Elastic Stack
  2. Build and operate production services that power core aspects of the Elastic business, including downloads, Docker registry, maps service, and more
  3. Support internal adoption of the Elastic Stack for software development and analytics use cases

Skills

Required

  • Software Developer
  • Python
  • JavaScript
  • Clojure
  • Haskell
  • Java
  • Go
  • Site-Reliability Engineering
  • Linux systems administration
  • SaaS platform operation
  • Infrastructure-as-code
  • Docker
  • Terraform
  • Kubernetes

Nice to have

  • Automate and monitor everything
  • Building reusable software components
  • Open source contributions
  • Versioned, Git-based workflow
  • Strong Linux fundamentals
  • syscall tracing
  • TCP internals
  • init systems
  • Open source
  • Distributed, asynchronous work environment
  • Written communication
  • Diverse, globally distributed teams
  • Collaborative, inclusive approach