What you'd actually do

Lead the design and implementation of shared, reusable data models, defining shared fact tables, conformed dimensions, and a semantic/metrics layer that serves as the single source of truth across analytics functions

Drive standardization of data engineering practices across ADE and functional analytics teams, including pipeline patterns, CI/CD workflows, naming conventions, and data modeling standards

Partner with Data Infrastructure to modernize orchestration, improve pipeline decomposition, and establish secure dev/test environments with production data access

Architect and implement a shift-left data governance strategy, working with upstream data producers to establish data contracts, SLOs, and code-enforced quality gates that catch issues before production

Collaborate with Data Science leads and Product Management to translate metric definitions into reliable, certified data pipelines that power executive dashboards, WBR reporting, and growth measurement

What the JD emphasized

12+ years of experience in data engineering or analytics engineering with increasing scope and technical leadership

12+ years of SQL experience, including complex analytical queries, window functions, and performance optimization at scale (Spark SQL)

8+ years of Python development experience, including building and maintaining production data pipelines

Deep expertise in dimensional data modeling, schema design, and scalable data architecture, with hands-on experience building shared data models across multiple business domains

Strong experience with orchestration tools (Airflow strongly preferred) and dbt, including pipeline design, scheduling strategies, and failure recovery patterns

Demonstrated ability to drive cross-team technical alignment, establishing standards, influencing without authority, and working across Data Engineering, Data Science, Data Infrastructure, and Product Engineering boundaries

Role Description

Dropbox is looking for a Staff Data Engineer to join our Analytics Data Engineering (ADE) team within Data Science & AI Platform. You will be responsible for solving cross-cutting data challenges that span multiple lines of business while driving standardization in how we build, deploy, and govern analytics pipelines across Dropbox.

This is not a maintenance role. We are modernizing our analytics platform, upgrading orchestration infrastructure, building shared and reusable data models with conformed dimensions, establishing a certified metrics framework, and laying the foundation for AI-native data development. You will partner closely with Data Science, Data Infrastructure, Product Engineering, and Business Intelligence teams to make this happen.

You will play a crucial role in establishing analytics engineering standards, designing scalable data models, and driving cross-functional alignment on data governance. You will get substantial exposure to senior leadership, shape the technical direction of analytics infrastructure at Dropbox, and directly influence how data powers product and business decisions.

Our Engineering Career Framework is viewable by anyone outside the company and describes what’s expected for our engineers at each of our career levels. Check out our blog post on this topic and more here.

Responsibilities

Lead the design and implementation of shared, reusable data models, defining shared fact tables, conformed dimensions, and a semantic/metrics layer that serves as the single source of truth across analytics functions
Drive standardization of data engineering practices across ADE and functional analytics teams, including pipeline patterns, CI/CD workflows, naming conventions, and data modeling standards
Partner with Data Infrastructure to modernize orchestration, improve pipeline decomposition, and establish secure dev/test environments with production data access
Architect and implement a shift-left data governance strategy, working with upstream data producers to establish data contracts, SLOs, and code-enforced quality gates that catch issues before production
Collaborate with Data Science leads and Product Management to translate metric definitions into reliable, certified data pipelines that power executive dashboards, WBR reporting, and growth measurement
Reduce operational burden by improving pipeline granularity, observability, and failure recovery, establishing runbooks and alerting standards that make on-call sustainable
Evaluate and integrate AI-native tooling into the data development lifecycle, enabling conversational data exploration with guardrails and AI-assisted pipeline development

Many teams at Dropbox run Services with on-call rotations, which entails being available for calls during both core and non-core business hours. If a team has an on-call rotation, all engineers on the team are expected to participate in the rotation as part of their employment. Applicants are encouraged to ask for more details of the rotations to which the applicant is applying.

Requirements

BS degree in Computer Science or related technical field, or equivalent technical experience
12+ years of experience in data engineering or analytics engineering with increasing scope and technical leadership
12+ years of SQL experience, including complex analytical queries, window functions, and performance optimization at scale (Spark SQL)
8+ years of Python development experience, including building and maintaining production data pipelines
Deep expertise in dimensional data modeling, schema design, and scalable data architecture, with hands-on experience building shared data models across multiple business domains
Strong experience with orchestration tools (Airflow strongly preferred) and dbt, including pipeline design, scheduling strategies, and failure recovery patterns
Demonstrated ability to drive cross-team technical alignment, establishing standards, influencing without authority, and working across Data Engineering, Data Science, Data Infrastructure, and Product Engineering boundaries

Preferred Qualifications

Experience with Databricks (Unity Catalog, Delta Lake) and modern lakehouse architectures
Experience leading orchestration or platform modernization efforts at scale
Familiarity with data governance and observability tools such as Atlan, Monte Carlo, Great Expectations, or similar
Experience building or contributing to a metrics/semantic layer (dbt MetricFlow, Databricks Metric Views, or equivalent)
Track record of establishing data engineering standards and best practices in a federated analytics organization

Compensation

US Zone 1

This role is not available in Zone 1

US Zone 2

$198,900—$269,100 USD

US Zone 3

$176,800—$239,200 USD

Role Description

Responsibilities

Drive standardization of data engineering practices across ADE and functional analytics teams, including pipeline patterns, CI/CD workflows, naming conventions, and data modeling standards

Partner with Data Infrastructure to modernize orchestration, improve pipeline decomposition, and establish secure dev/test environments with production data access

Reduce operational burden by improving pipeline granularity, observability, and failure recovery, establishing runbooks and alerting standards that make on-call sustainable

Evaluate and integrate AI-native tooling into the data development lifecycle, enabling conversational data exploration with guardrails and AI-assisted pipeline development

Requirements

BS degree in Computer Science or related technical field, or equivalent technical experience

12+ years of experience in data engineering or analytics engineering with increasing scope and technical leadership

12+ years of SQL experience, including complex analytical queries, window functions, and performance optimization at scale (Spark SQL)

8+ years of Python development experience, including building and maintaining production data pipelines

Deep expertise in dimensional data modeling, schema design, and scalable data architecture, with hands-on experience building shared data models across multiple business domains

Strong experience with orchestration tools (Airflow strongly preferred) and dbt, including pipeline design, scheduling strategies, and failure recovery patterns

Preferred Qualifications

Experience with Databricks (Unity Catalog, Delta Lake) and modern lakehouse architectures

Experience leading orchestration or platform modernization efforts at scale

Familiarity with data governance and observability tools such as Atlan, Monte Carlo, Great Expectations, or similar

Experience building or contributing to a metrics/semantic layer (dbt MetricFlow, Databricks Metric Views, or equivalent)

Track record of establishing data engineering standards and best practices in a federated analytics organization

Staff Data Engineer, Analytics Data Engineering

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Role Description

Responsibilities

Requirements

Preferred Qualifications

Compensation

Role Description

Responsibilities

Requirements

Preferred Qualifications

Compensation