Senior Cloud Data Infrastructure Engineer

ClickHouse ClickHouse · Data AI · Product & Engineering

This role focuses on building and maintaining cloud-native infrastructure for ClickHouse, specifically developing auto-scaling capabilities and improving the metrics pipeline. It involves working with Kubernetes operators, distributed systems, and public cloud providers to evolve ClickHouse into a serverless and cloud-native database solution.

What you'd actually do

  1. Build a cutting-edge Cloud Native platform on top of the public cloud.
  2. Improve the metrics pipeline and build algorithms to generate better autoscaling statistics and recommendations.
  3. Work on the autoscale and Kubernetes operator to support seamless Vertical and Horizontal Auto-scaling.
  4. Work closely with our ClickHouse core development team and other data plane teams, partnering with them to support auto-scaling use cases as well as other internal infrastructure improvements.
  5. Architecting and building a robust, scalable, and highly available distributed infrastructure

Skills

Required

  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems.
  • Experience building operators with Kubernetes, controller runtime
  • Production experience with programming languages like Go, C++
  • PagerDuty On-call, debugging things in production
  • Expertise with a public cloud provider (AWS, GCP, Azure) and their infrastructure as a service offering (e.g., EC2).
  • Data Storage, Ingestion, and Transformation (Spark, Kafka or similar tools).

Nice to have

  • Python (uv, rye, fastAPI)
  • Data Science (Pandas, NumPy etc)

What the JD emphasized

  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems.
  • Experience building operators with Kubernetes, controller runtime
  • Production experience with programming languages like Go, C++
  • Expertise with a public cloud provider (AWS, GCP, Azure) and their infrastructure as a service offering (e.g., EC2).