Senior Data Engineer - Data Engineering

Plaid · Fintech · San Francisco, CA · Engineering

Plaid is seeking a Senior Data Engineer to build and scale data systems, focusing on data quality, performance, and enabling data-driven decisions across the company. The role involves owning core SQL and Python data pipelines, working with tools like DBT, Airflow, and Redshift, and collaborating with various teams to define Plaid's data strategy.

What you'd actually do

  1. Understanding different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
  2. Have data quality and performance top of mind while designing datasetsLeading key data engineering projects that drive collaboration across the company.
  3. Advocating for adopting industry tools and practices at the right timeOwning core SQL and python data pipelines that power our data lake and data warehouse.
  4. Well-documented data with defined dataset quality, uptime, and usefulness.

Skills

Required

  • SQL
  • Python
  • DBT
  • Airflow
  • Redshift
  • Spark
  • Kafka
  • schema design
  • data privacy
  • data integrity

Nice to have

  • Atlan
  • Retool
  • Snowflake
  • Databricks
  • Mode

What the JD emphasized

  • 4+ years of dedicated data engineering experience
  • building data models and data pipelines on top of large datasets (in the order of 500TB to petabytes)
  • experience building and maintaining batch and realtime pipelines