Cloud Software Engineer - Observability Platform

ClickHouse ClickHouse · Data AI · United States · Engineering

Software Engineer role focused on building and operating a high-throughput telemetry platform for observability, involving distributed systems, reliability, performance, and cost-efficiency. The role requires experience with production systems, Golang, Kubernetes, and cloud providers, with a focus on automation and incident response.

What you'd actually do

  1. Design, build, and operate distributed systems that power observability across ClickHouse Cloud
  2. Own reliability, performance, and cost-efficiency of our telemetry pipeline and storage systems
  3. Take part in the on-call rotation and help drive root-cause resolution and long-term fixes
  4. Build tooling and automation to eliminate repetitive operational work
  5. Help shape the roadmap for observability by identifying bottlenecks and scaling challenges

Skills

Required

  • Golang
  • Kubernetes
  • Helm
  • ArgoCD
  • Terraform
  • AWS
  • GCP
  • Azure
  • OpenTelemetry
  • Prometheus
  • Grafana

Nice to have

  • ClickHouse

What the JD emphasized

  • 5+ years building and running production systems at scale