Senior Software Engineer (infrastructure) - Clickstack

ClickHouse ClickHouse · Data AI · United States · Engineering

Senior Software Engineer (Infrastructure) at ClickHouse, focusing on building and scaling a petabyte-scale observability platform. The role involves designing and implementing backend systems, APIs, and cloud-native infrastructure using Kubernetes, Helm, and IaC. Responsibilities include ensuring reliability, scalability, performance, and operational excellence through CI/CD, monitoring, and automation. The role also involves engineering for scale in ingestion and data processing pipelines and engaging with the open-source community.

What you'd actually do

  1. Design and implement backend systems and APIs that power ClickStack, enabling engineers to ingest, query, and analyze observability data at massive scale.
  2. Architect, deploy, and maintain cloud-native systems that ensure reliability, scalability, and performance. You’ll use Kubernetes, Helm, and infrastructure-as-code to make deployments simple and resilient.
  3. Define best practices for CI/CD, monitoring, logging, and alerting. Drive automation across testing, scaling, and incident response to keep our platform healthy and developer-friendly.
  4. Design and operate ingestion and data processing pipelines that remain performant, resilient, and observable—even as we grow to petabyte-level workloads.
  5. Collaborate with open-source contributors and customers, solve their challenges, and incorporate their feedback into our roadmap.

Skills

Required

  • 5+ years of backend engineering experience
  • Strong TypeScript and Node.js skills
  • Deep understanding of APIs, event-driven systems, and high-throughput data pipelines
  • Proficiency in SQL and experience working with analytical databases
  • Experience with Docker and Kubernetes
  • Experience with Helm for managing production deployments
  • Experience with infrastructure-as-code (Terraform, Pulumi, or similar)
  • Familiarity with CI/CD pipelines, monitoring systems, and production-grade alerting practices

Nice to have

  • ClickHouse experience
  • Expertise in distributed systems, ingestion pipelines, and columnar databases
  • Strong opinions on observability tools and a vision for making them 10x better
  • Experience scaling multi-tenant SaaS platforms in AWS, GCP, or Azure
  • Knowledge of service meshes, networking, and advanced Kubernetes features
  • Background in developer tooling or open-source contributions
  • Experience building SDKs or integrations in multiple languages
  • Familiarity with ReactJS and frontend development