Data Engineer, Analytics Data Engineering

Dropbox Dropbox · Enterprise · Canada +2 · CTO-Data Science, AI Platform & Eng (Sub Team)

Data Engineer role focused on building large, scalable analytics pipelines from scratch using modern Big Data technologies like Spark and Databricks. Responsibilities include defining data assets, designing integrations and quality frameworks, and collaborating with business units and engineering teams on data platform architecture. Requires extensive experience with Spark, SQL, and data modeling.

What you'd actually do

  1. Help define company data assets (data model), Spark, SparkSQL jobs to populate data models
  2. Help define and design data integrations, data quality frameworks and design and evaluate open source/vendor tools for data lineage
  3. Work closely with Dropbox business units and engineering teams to develop strategy for long term Data Platform architecture to be efficient, reliable and scalable
  4. Conceptualize and own the data architecture for multiple large-scale projects, while evaluating design and operational cost-benefit tradeoffs within systems
  5. Collaborate with engineers, product managers, and data scientists to understand data needs, representing key data insights in a meaningful way

Skills

Required

  • Spark
  • Python
  • Java
  • C++
  • Scala
  • SQL
  • schema design
  • dimensional data modeling
  • medallion architectures
  • Databricks platform
  • data lake architectures
  • product strategic thinking
  • communications
  • data processing systems

Nice to have

  • Airflow
  • data quality monitoring
  • MonteCarlo