Senior Site Reliability Engineer

Fivetran Fivetran · Data AI · Oakland, CA · Engineering Department

Fivetran is seeking a Senior Site Reliability Engineer to join their team. The role involves monitoring, alerting, incident response, and supporting the deployment pipeline for Fivetran's infrastructure. The engineer will collaborate with other teams to integrate reliability best practices, resolve critical bugs, and ensure 100% availability of production infrastructure through automation. The position requires experience with SaaS products at scale, Kubernetes administration, cloud platforms (AWS, GCP), Terraform, Python/Shell scripting, Linux internals, and databases like PostgreSQL.

What you'd actually do

  1. monitoring the availability, capacity, and throughput of Fivetran's production infrastructure to identify and address potential issues.
  2. Collaborate with engineering teams to integrate reliability best practices into the product roadmap
  3. Support the prioritization and resolution of critical bugs identified by support or sales.
  4. Contribute to maintaining 100% availability of production infrastructure by collaborating with engineering to implement automation for scalable deployments
  5. Proactively monitor infrastructure vulnerabilities and collaborate with the security team to address them in a timely manner.

Skills

Required

  • 5+ years of experience working with SaaS products at scale
  • Working experience of Kubernetes administration
  • Knowledge of Cloud Platforms and related tooling: AWS, GCP, Terraform, configuration management
  • Experience in Python/Shell scripting
  • Experience with Linux operating systems internals and administration
  • Experience with databases such as PostgreSQL

Nice to have

  • Java