Senior Software Engineer - Database Engine Internals

Databricks Databricks · Data AI · San Francisco, CA · Engineering - Pipeline

Databricks is seeking a Senior Software Engineer to work on their database engine internals, focusing on building the next generation of their Lakehouse platform. This role involves designing and implementing systems for query compilation, optimization, distributed execution, storage structures, and more, aiming to outperform specialized data warehouses and support diverse workloads from ETL to data science.

What you'd actually do

  1. Query compilation and optimization
  2. Distributed query execution and scheduling
  3. Vectorized execution engine
  4. Data security
  5. Resource management
  6. Transaction coordination
  7. Efficient storage structures (encodings, indexes)
  8. Automatic physical data optimization

Skills

Required

  • database systems
  • storage systems
  • distributed systems
  • language design
  • performance optimization
  • 5+ years of experience working in a related system

Nice to have

  • PhD in databases or distributed systems