Staff Site Reliability Engineer

Fivetran Fivetran · Data AI · Novi Sad, Serbia · Engineering Department

Fivetran is seeking a Staff Site Reliability Engineer to ensure the performance, reliability, and scalability of their data platform infrastructure. The role involves collaborating with engineering teams, managing Kubernetes and cloud platforms, and implementing automation for deployments and incident response.

What you'd actually do

  1. Responsible for the ongoing reliability and robustness of Fivetran’s production infrastructure by monitoring availability, capacity, and throughput.
  2. Collaborate with engineering teams to integrate reliability best practices into the product roadmap
  3. Support the prioritization and resolution of critical bugs identified by support or sales.
  4. Contribute to maintaining the high reliability and availability of production infrastructure by collaborating with engineering to implement automation for scalable deployments
  5. Ensure scalable artifacts deployment to all environments through automation scripts.

Skills

Required

  • SaaS platforms at scale
  • Managed Kubernetes (EKS, AKS, and GKE)
  • Cloud Platforms and related tooling: AWS, Azure, GCP, Terraform, Ansible, Buildkite, Pulumi, and ArgoCD
  • Python
  • Shell scripting
  • Go
  • Linux operating systems, internals, and administration
  • cloud networking like Managed NAT Gateways, VPNs, Privatelinks, and Private Service Connect (GCP)
  • PostgreSQL

Nice to have

  • Java

What the JD emphasized

  • 7+ years of experience working with SaaS platforms at scale
  • Expertise in managed Kubernetes (EKS, AKS, and GKE)