Senior Staff Data Engineer, Marketplaces Dna

Airbnb Airbnb · Consumer · United States · Software Engineering

Senior Staff Data Engineer role focused on building and scaling data infrastructure and pipelines to power Airbnb's Guest and Host products, including machine learning models and data products. The role involves designing foundational data models, ensuring data quality, and optimizing data processing systems at petabyte scale.

What you'd actually do

  1. Develop and automate large scale, high-performance batch and streaming data processing systems to power Airbnb’s Guest and Host products, machine learning models, and business insights.
  2. Partner closely with infra teams to improve scalability, data governance, and efficiency
  3. Evangelize high quality software engineering practices towards building data infrastructure and pipelines at scale, collaborate with infrastructure teams to streamline best practices.
  4. Advocate for high bar for data and engineering quality ensure eng deliverables are reliable, efficient, well documented, testable, & maintainable.
  5. Design our data models for optimal storage and understanding, with thoughtful dataflows to power critical product and business requirements.

Skills

Required

  • 12+ years of relevant industry experience with a Bachelor’s and/or Master’s degree in CS/EE, or equivalent experience, or 9+ years of experience with a PhD
  • Experience collaborating with client, backend, ml, analytics teams, product and business partners
  • Experience designing and deploying high performance systems with reliable monitoring and logging practices.
  • Effectively work across team boundaries to establish overarching data architecture, data flow, and provide guidance to individual teams.
  • Strong knowledge of relational databases and query authoring (SQL).
  • Strong expertise with Java / Scala / Spark and operating on data at the petabyte scale.
  • Excellent communication skills, both written and verbal.

What the JD emphasized

  • petabyte scale

Other signals

  • ML models
  • data products
  • petabyte scale
  • batch and streaming data processing