Sr. Data Engineer, Ww Standardization & Automation

Amazon Amazon · Big Tech · Bellevue, WA · Business Intelligence

Seeking a Senior Data Engineer to establish a reliable and accessible data platform, ensuring Operations Finance customers have trust in the data, technology, and tools to make data-driven business decisions. The role involves designing, implementing, and supporting scalable data infrastructure solutions and complex data models, integrating with various data sources, and curating data for reporting, analysis, and machine learning models. Responsibilities include building data pipelines and data-sets using Python, Airflow, and SQL, automating deployments with AWS tools, and working with AWS services like Redshift, Glue, and S3.

What you'd actually do

  1. Interface with other technology teams to extract, transform, and load data from a wide variety of data sources using Python, SQL and AWS big data technologies.
  2. Explore and learn the latest AWS technologies to provide new capabilities and increase efficiencies of the team.
  3. Designing and implementing complex pipelines and other Data Engineering solutions.
  4. Work closely with business owners, developers, Business Intelligence Engineer to explore new data sources and deliver the data.
  5. Create extensible designs and easy to maintain solutions with the long term vision in mind

Skills

Required

  • data engineering
  • data modeling
  • warehousing
  • ETL pipelines
  • SQL
  • Python
  • AWS Redshift
  • AWS S3
  • AWS Glue
  • AWS EMR
  • AWS Kinesis
  • AWS FireHose
  • AWS Lambda
  • AWS IAM roles and permissions
  • mentoring

Nice to have

  • Java
  • Scala
  • NodeJS
  • highly available, distributed systems
  • MPP databases
  • distributed systems
  • data storage
  • computing
  • orchestration frameworks
  • AWS step functions
  • Airflow

What the JD emphasized

  • 7+ years of data engineering experience
  • Experience with data modeling, warehousing and building ETL pipelines
  • Experience with SQL
  • Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
  • Experience mentoring team members on best practices
  • Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
  • Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
  • Experience with MPP databases such as Amazon Redshift
  • Knowledge of distributed systems as it pertains to data storage and computing
  • Deep knowledge of orchestration frameworks like step functions / Airflow.