Senior Infrastructure Engineer - Postgres

ClickHouse ClickHouse · Data AI · Engineering

Senior Infrastructure Engineer responsible for reliability, automation, and operations of ClickHouse's cloud data platform, focusing on Postgres integration across AWS, GCP, and Azure. The role involves designing and implementing infrastructure-as-code, developing Go-based tooling, owning observability, and driving incident management.

What you'd actually do

  1. Lead reliability and operations for ClickHouse’s Postgres integration — upgrades, patching, maintenance, and scaling.
  2. Design and implement automation for provisioning, deployments, and service lifecycle management across AWS, GCP, and Azure.
  3. Develop infrastructure-as-code using Terraform and modern CI/CD tooling to ensure consistent, repeatable deployments.
  4. Contribute Go-based tooling and services that improve automation, observability, and developer experience.
  5. Own observability and monitoring, ensuring robust alerting, metrics, and tracing across environments.

Skills

Required

  • SRE
  • DevOps
  • Infrastructure Engineering
  • Postgres operations
  • AWS
  • GCP
  • Azure
  • Terraform
  • Kubernetes
  • Go development
  • Prometheus
  • Grafana
  • Loki
  • OpenTelemetry
  • SLOs
  • Incident Response

Nice to have

  • multi-cloud topologies
  • container-based infrastructure
  • developer experience
  • service operability

What the JD emphasized

  • 7+ years in SRE, DevOps, or infrastructure engineering
  • Solid understanding of Postgres operations, scaling, and performance tuning
  • Deep hands-on experience across AWS, with exposure to GCP and Azure
  • Proficient with Terraform, Kubernetes, and container-based infrastructure
  • Strong Go development skills (or willingness to write and own production Go code)
  • Familiar with tools like Prometheus, Grafana, Loki, OpenTelemetry, or equivalents
  • Deep understanding of SLOs, incident response, and continuous improvement in service reliability
  • founder’s mentality
  • ownership, reliability, and speed are core values