Staff Software Engineer - Database Engine Internals

Databricks Databricks · Data AI · San Francisco, CA · Engineering - Pipeline

Databricks is seeking a Staff Software Engineer to work on the core database engine internals, focusing on building next-generation query engines and structured storage systems. The role involves designing and implementing systems for query compilation, optimization, distributed execution, vectorized engines, data security, resource management, transaction coordination, and efficient storage structures to support diverse workloads from ETL to data science within their unified data and AI platform.

What you'd actually do

  1. Query compilation and optimization
  2. Distributed query execution and scheduling
  3. Vectorized execution engine
  4. Data security
  5. Resource management
  6. Transaction coordination
  7. Efficient storage structures (encodings, indexes)
  8. Automatic physical data optimization

Skills

Required

  • database systems
  • storage systems
  • distributed systems
  • language design
  • performance optimization
  • multi-year vision
  • incremental deliverables
  • customer value and impact

Nice to have

  • PhD in databases or distributed systems

What the JD emphasized

  • radically simplify the whole data lifecycle
  • unified platform
  • next generation (decoupled) query engine
  • structured storage system
  • leapfrog state-of-the-art