Senior Software Engineer, Data Compute

Robinhood Robinhood · Fintech · Bellevue, WA +2 · ENG Infrastructure

Senior Software Engineer on the Data Compute team responsible for building and evolving Robinhood's large-scale Spark and Airflow environments, including migrating workloads to Databricks and optimizing lakehouse fundamentals.

What you'd actually do

  1. Design and build scalable platform primitives for Spark and Airflow to support Robinhood’s global data infrastructure needs.
  2. Lead the migration and modernization of Spark workloads to serverless Databricks and Delta Lake architectures.
  3. Optimize compute resource utilization and efficiency to manage costs across large-scale distributed systems.
  4. Collaborate with internal teams across analytics and product engineering to deliver a seamless, self-serve data processing experience.
  5. Improve platform reliability and governance by implementing advanced metadata management and access controls via Unity Catalog and Trino.

Skills

Required

  • large-scale Spark
  • Databricks
  • Airflow
  • lakehouse fundamentals
  • Delta Lake
  • Parquet
  • Trino
  • Pinot
  • Hive Metastore

Nice to have

  • serverless patterns
  • Unity Catalog

What the JD emphasized

  • large-scale Spark and Databricks or similar platform infrastructure
  • data orchestration using Airflow
  • lakehouse fundamentals
  • query and serving infrastructure
  • multi-team platform reliability, including cost optimization and developer experience initiatives