Senior Cloud Performance Engineer- Remote

ClickHouse ClickHouse · Data AI · Cloud Engineering

ClickHouse is seeking a Senior Cloud Performance Engineer to build and optimize their cloud-native ClickHouse Cloud Platform. The role involves benchmarking, performance analysis, capacity sizing, troubleshooting, and driving chaos engineering initiatives for large-scale distributed systems. The ideal candidate has extensive experience in distributed systems performance, database benchmarking, test automation, and cloud infrastructure.

What you'd actually do

  1. Benchmark system performance, database performance analysis, capacity sizing and optimization.
  2. Troubleshoot and debug applications, server errors, logs, and triage accordingly.
  3. Recommend configuration tuning/optimizations for performance bottlenecks.
  4. Work closely and partner with ClickHouse's core development team, cloud team, and security team to improve the performance of ClickHouse Cloud.
  5. Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities

Skills

Required

  • 6+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems.
  • Software development experience in Go, C/C++, Java, or similar.
  • Experience with concurrency, multithreading, and the deployment of distributed system architectures
  • Experience developing cloud infrastructure services, preferably with Kubernetes.
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers.
  • Expertise with a public cloud provider (AWS, GCP, Azure) and their infrastructure as a service offering (e.g. EC2).
  • excellent communication skills
  • strong problem solver
  • solid production debugging skills

Nice to have

  • passion for efficiency, availability, scalability and data governance
  • Thrive in a fast paced environment
  • high level of responsibility, ownership, and accountability

What the JD emphasized

  • proven track record of understanding the performance limits of different distributed databases
  • strong background in database benchmarking
  • test automation
  • system engineering
  • performance analysis
  • capacity management
  • large scope technical projects