Senior Software Engineer - Database Integrations

ClickHouse ClickHouse · Data AI · Engineering

Senior Software Engineer role focused on building and optimizing a high-performance data onboarding platform at petabyte scale. Responsibilities include designing and developing integrations with diverse data sources, datastores, and object storages, leveraging ClickHouse's performance, and collaborating across teams. Requires 5+ years of experience in high-scale, data-intensive software solutions, proficiency in Golang or Java, strong data engineering skills (ETL, CDC), cloud-native proficiency, and Kubernetes experience.

What you'd actually do

  1. Design and develop a high-performance data onboarding platform capable of handling petabytes of data in real-time.
  2. Build robust integrations with a variety of data sources (e.g., MySQL, Mongo, Iceberg), data storages (e.g., Snowflake, Redshift, ClickHouse), and object storages (e.g., S3, GCS).
  3. Create solutions that enable users to fully harness ClickHouse’s exceptional performance and throughput.
  4. Work closely with internal teams to ensure the platform aligns with customer needs and business objectives.
  5. Lead and influence technical discussions, continuously identifying and implementing improvements.

Skills

Required

  • 5+ years of industry experience building high-scale, data-intensive software solutions
  • Proficient in Golang or Java
  • Deep experience in distributed systems and microservices architecture
  • Strong background in designing and implementing robust ETL pipelines
  • Understanding of data replication methodologies such as CDC
  • Solid experience with cloud-native architecture and infrastructure
  • Hands-on knowledge of at least one major CSP
  • Practical experience with Kubernetes (K8s)
  • Exceptional production debugging skills
  • Ability to navigate and solve complex technical issues in fast-paced environments
  • High autonomy
  • Founder's mindset
  • Excellent communication skills
  • Track record of working effectively across teams

What the JD emphasized

  • high-scale
  • data-intensive
  • high-performance
  • petabytes of data
  • real-time
  • robust integrations
  • exceptional performance
  • high-velocity
  • significant autonomy
  • founder's mindset
  • measurable impact