Cloud Software Engineer

ClickHouse ClickHouse · Data AI · Product & Engineering

ClickHouse is seeking an experienced Cloud Software Engineer to join their Observability team. This role involves designing, building, and operating a high-throughput telemetry platform that powers internal monitoring and customer-facing observability features. The engineer will be responsible for the reliability, performance, and cost-efficiency of the pipeline and storage systems, participating in on-call rotations, and developing automation to reduce operational work. The position requires strong production debugging skills, experience with distributed systems, cloud providers, Kubernetes, and observability tools like OpenTelemetry and Prometheus.

What you'd actually do

  1. Design, build, and operate distributed systems that power observability across ClickHouse Cloud
  2. Own reliability, performance, and cost-efficiency of our telemetry pipeline and storage systems
  3. Take part in the on-call rotation and help drive root-cause resolution and long-term fixes
  4. Build tooling and automation to eliminate repetitive operational work
  5. Help shape the roadmap for observability by identifying bottlenecks and scaling challenges

Skills

Required

  • 5+ years building and running production systems at scale
  • Proficiency in at least one systems-level language (Go, C++, Rust, Python)
  • Experience with Kubernetes, Helm, ArgoCD, and Terraform or similar IaC tools
  • Comfortable working with at least one major cloud provider (AWS, GCP, Azure)
  • Familiarity with OpenTelemetry, Prometheus, Grafana, or similar tools
  • Experience with ClickHouse

Nice to have

  • SRE
  • Systems Engineer
  • DevOps

What the JD emphasized

  • 5+ years building and running production systems at scale
  • Strong bias for action and ownership
  • Great production debugging skills
  • problem-solving mindset