Senior Software Engineer, Data Engineering

Robinhood Robinhood · Fintech · Menlo Park, CA +2 · ENG Data Science

Senior Data Engineer at Robinhood responsible for building and maintaining foundational datasets, designing scalable data pipelines, and collaborating with various teams to ensure data availability and quality for product analytics, growth strategy, financial reporting, and machine learning.

What you'd actually do

  1. Help define and build key datasets across all Robinhood product areas. Lead the evolution of these datasets as use cases grow.
  2. Build scalable data pipelines using Python, Spark and Airflow to move data from different applications into our data lake.
  3. Partner with upstream engineering teams to enhance data generation patterns.
  4. Partner with data consumers across Robinhood to understand consumption patterns and design intuitive data models.
  5. Ideate and contribute to shared data engineering tooling and standards.

Skills

Required

  • Python
  • Spark
  • Airflow
  • SQL
  • Data Pipelines
  • Data Modeling
  • Software Engineering

Nice to have

  • Flink
  • Presto
  • Spark SQL

What the JD emphasized

  • 5+ years of professional experience building end-to-end data pipelines
  • Hands-on software engineering experience, with the ability to write production-level code in Python for user-facing applications, services, or systems (not just data scripting or automation).
  • Expert at building and maintaining large-scale data pipelines using open source frameworks (Spark, Flink, etc)
  • Strong SQL (Presto, Spark SQL, etc) skills.