Senior Software Engineer - Ingestion

Databricks Databricks · Data AI · Bangalore, India · Engineering - Pipeline

Databricks is seeking a Senior Software Engineer for their Lakeflow Connect team, focusing on data ingestion into the Lakehouse. The role involves building scalable, fault-tolerant systems for extracting data from various sources, including OLTP systems, using techniques like incremental data capture and log parsing. The engineer will work on core database internals and large-scale distributed systems, contributing to architecture design, roadmap influence, and technical project leadership.

What you'd actually do

  1. Solve real business needs at large scale by applying your software engineering.
  2. Deliver a highly scalable, available, and fault-tolerant engine processing hundreds of TB of data daily across thousands of customers
  3. Low level systems debugging, performance measurement & optimization on large production clusters.
  4. Build architecture design, influence product roadmap, and take ownership and responsibility over new projects
  5. Use your deep experience to help prevent and investigate production issues.

Skills

Required

  • BS (or higher) in Computer Science, or a related field
  • 6+ years of production level experience in one of: Python, Java, Scala, C++, or similar language.
  • Experience developing large-scale distributed systems from scratch
  • Hands-on experience in developing and operating backend systems

Nice to have

  • Experience in areas like Database replication, backup, transaction recovery at one of the major database vendors (Microsoft SQL Server , Oracle, IBM etc) is a plus to have.

What the JD emphasized

  • 6+ years
  • large-scale distributed systems
  • Hands-on experience in developing and operating backend systems