Clickhouse Operations Engineer

PostHog · Enterprise · Remote · Engineering

PostHog is seeking a ClickHouse Operations Engineer to automate, manage, and maintain their ClickHouse infrastructure, which is critical for ingesting, storing, and querying data. The role involves scaling infrastructure, optimizing performance, building systems for dynamic provisioning of large ClickHouse clusters (over 100 PB), and working on a query optimizer and performance tooling. Experience with OLAP databases, Terraform, Ansible, Kubernetes, and large-scale data solutions is required.

What you'd actually do

  1. automate, manage, and maintain ClickHouse as we grow towards capturing trillions of events per year and having one of the world’s largest clusters.
  2. ensure that we have the right hardware deployed at the right time for each workload on ClickHouse.
  3. build systems and automations for the provisioning and scaling of our large ClickHouse clusters, handling over 100 PB's of data.
  4. investigate and experiment using the latest hardware that cloud providers have to offer in order to find the optimal setup for our solution.
  5. work on a bleeding edge ClickHouse implementation, like open format backed tables, and not just maintenance.
  6. building a query optimizer for ClickHouse, which means you will work on query performance tooling.

Skills

Required

  • OLAP Database Experience (internals of ClickHouse or other OLAP Databases)
  • Automating Dynamic provisioning instances
  • Terraform
  • Ansible
  • Kubernetes
  • Scale and Complexity in data storage solutions
  • Python
  • AWS
  • Zookeeper (or alternative)

What the JD emphasized

  • ClickHouse operations and scaling infrastructure
  • node and instance-level performance optimization
  • automating the dynamic provisioning of instances
  • query optimizer for ClickHouse
  • query performance tooling