Data Engineer Ii, Zappos Analytics

Amazon Amazon · Big Tech · NY +1 · Business Intelligence

Data Engineer role focused on designing, developing, and maintaining data infrastructure, including data pipelines, ETL processes, and data warehousing, to support analysis, reporting, and machine learning applications. The role involves collaborating with cross-functional teams and potentially redesigning data architecture. Experience with generative AI tools is preferred.

What you'd actually do

  1. Design, build, and maintain robust data pipelines to acquire, process, and store data from various sources such as databases, APIs, and external data providers.
  2. Develop and optimize ETL (Extract, Transform, Load) processes to clean, enrich, and structure raw data into a usable format for analysis and reporting.
  3. Implement and manage data warehousing solutions to ensure efficient data storage, retrieval, and query performance.
  4. Establish data quality standards, perform data validation, and proactively identify and address data quality issues.
  5. Optimize data pipelines and storage solutions to handle large volumes of data while maintaining high performance and reliability.

Skills

Required

  • data engineering experience
  • data modeling
  • data warehousing
  • ETL pipelines
  • distributed systems
  • data storage
  • data computing
  • data privacy
  • data security
  • data protection regulations
  • documentation

Nice to have

  • AWS technologies (Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, IAM)
  • non-relational databases
  • generative AI tools
  • prompting
  • evaluation practices
  • identifying generative AI opportunities