Senior Software Engineer (infrastructure) - Hyperdx

ClickHouse ClickHouse · Data AI · Engineering

This role is for a Senior Software Engineer focused on building and scaling the backend infrastructure for HyperDX, an observability platform. The responsibilities include designing and implementing APIs, scaling cloud-native systems using Kubernetes and IaC, ensuring operational excellence through CI/CD and monitoring, and engineering high-throughput data pipelines. While the company mentions 'AI workloads' in its general description, the core of this role is focused on infrastructure and backend engineering for an observability tool, not directly building or deploying AI models.

What you'd actually do

  1. Design and implement backend systems and APIs that power HyperDX, enabling engineers to ingest, query, and analyze observability data at massive scale.
  2. Architect, deploy, and maintain cloud-native systems that ensure reliability, scalability, and performance. You’ll use Kubernetes, Helm, and infrastructure-as-code to make deployments simple and resilient.
  3. Define best practices for CI/CD, monitoring, logging, and alerting. Drive automation across testing, scaling, and incident response to keep our platform healthy and developer-friendly.
  4. Design and operate ingestion and data processing pipelines that remain performant, resilient, and observable—even as we grow to petabyte-level workloads.
  5. Collaborate with open-source contributors and customers, solve their challenges, and incorporate their feedback into our roadmap.

Skills

Required

  • backend engineering experience
  • TypeScript
  • Node.js
  • APIs
  • event-driven systems
  • high-throughput data pipelines
  • SQL
  • Docker
  • Kubernetes
  • Helm
  • infrastructure-as-code
  • CI/CD pipelines
  • monitoring systems
  • production-grade alerting practices
  • cloud-native systems

Nice to have

  • ClickHouse experience
  • distributed systems
  • ingestion pipelines
  • columnar databases
  • observability tools
  • multi-tenant SaaS platforms
  • AWS
  • GCP
  • Azure
  • service meshes
  • networking
  • advanced Kubernetes features
  • developer tooling
  • open-source contributions
  • SDKs
  • ReactJS
  • frontend development