Staff Data Engineer, Ads

Discord Discord · Consumer · Remote · Data Science & Engineering

Staff Data Engineer for Discord's Ads team, focusing on building and maintaining data pipelines, datasets, and analytical tools for advertising products. This role will drive technical vision and strategy for ads data infrastructure, including feature pipelines, label generation, and training data systems for ML models. The engineer will also develop data quality frameworks and collaborate with data scientists and ML engineers.

What you'd actually do

  1. Create and maintain complex, enterprise-scale data pipelines and foundational datasets while defining technical strategy and architectural direction for advertising products
  2. Design and build sophisticated ETL processes, data models, and analytical frameworks using SQL, Python, and modern data stack technologies
  3. Build and maintain the data infrastructure that powers Ads ML - feature pipelines, label generation workflows, and training data systems that enable our ranking and delivery models
  4. Develop data quality frameworks, monitoring systems, automated anomaly detection, and alerting infrastructure that operates at massive scale
  5. Collaborate with data scientists, ML engineers, and product teams to identify high-impact data infrastructure opportunities, owning design through implementation

Skills

Required

  • 7+ years of hands-on experience writing production code and architecting data pipelines with high-volume consumer data in advertising technology domains (eg. ad delivery, ranking, targeting, identity)
  • 7+ years of direct implementation experience designing, coding, and maintaining complex data models and systems handling structured and unstructured data sources
  • Expert-level coding abilities in SQL, Python, and modern data engineering frameworks with demonstrated ability to write performant, maintainable, and scalable code
  • Digital advertising data engineering expertise with hands-on experience building high-throughput data pipelines for ad serving, conversion tracking, advertising measurement, or integrating and normalizing third-party advertising data from external platforms and partners
  • Proven hands-on experience implementing and debugging data quality audits, monitoring systems, and automated remediation for massive datasets (billions+ rows)
  • Strong technical communication abilities to explain complex implementations to stakeholders while thriving in rapidly-evolving technical environments
  • Hands-on collaboration experience implementing solutions with data science, ML engineering, and product teams through direct technical contribution

Nice to have

  • Passion for Discord or gaming in general
  • Hands-on integration experience implementing connections with external data sources, APIs, and third-party advertising platforms
  • Experience with modern data storage and processing technologies (BigQuery SQL, Airflow, Dagster, DBT, or similar)
  • Experience with data visualization and dashboarding technologies (Looker, Tableau, or similar)
  • Experience with designing data architecture to power a variety of use cases, including experimentation

What the JD emphasized

  • advertising technology domains
  • high-volume consumer data
  • structured and unstructured data sources
  • massive datasets (billions+ rows)

Other signals

  • data pipelines
  • feature stores
  • ML data engineering
  • training data systems