Senior Cloud Data Infrastructure Engineer

ClickHouse ClickHouse · Data AI · Engineering

This role focuses on building and maintaining the auto-scaling infrastructure for ClickHouse Cloud, aiming to transform it into a serverless database solution. It involves improving the metrics pipeline, developing algorithms for autoscaling recommendations, and working with Kubernetes operators and distributed systems.

What you'd actually do

  1. Build a cutting-edge Cloud Native platform on top of the public cloud.
  2. Improve the metrics pipeline and build algorithms to generate better autoscaling statistics and recommendations.
  3. Work on the autoscale and Kubernetes operator to support seamless Vertical and Horizontal Auto-scaling.
  4. Work closely with our ClickHouse core development team and other data plane teams, partnering with them to support auto-scaling use cases as well as other internal infrastructure improvements.
  5. Architecting and building a robust, scalable, and highly available distributed infrastructure

Skills

Required

  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems.
  • Experience building operators with Kubernetes, controller runtime
  • Production experience with programming languages like Go, C++
  • Expertise with a public cloud provider (AWS, GCP, Azure) and their infrastructure as a service offering (e.g., EC2).
  • Experience with Data Storage, Ingestion, and Transformation (Spark, Kafka or similar tools).

Nice to have

  • Experience with Python (uv, rye, fastAPI)
  • Data Science (Pandas, NumPy etc)
  • You are not a stranger to PagerDuty On-call, debugging things in production and are a strong problem-solver
  • You are passionate about solving data problems at Scale.
  • You have excellent communication skills and the ability to work well within and across engineering teams