Data Engineer, Ois/cxi Analytics

Amazon Amazon · Big Tech · Austin, TX · Data Science

Data Engineer role focused on building and maintaining scalable data pipelines and ML-ready data infrastructure to support AI-driven operational insights and GenAI solutions within Amazon's Operations Technology ecosystem. The role involves ETL/ELT, feature engineering, and supporting Data Science and ML initiatives.

What you'd actually do

  1. Design, build, and maintain production-grade ETL/ELT pipelines and big data infrastructure supporting OTS operational intelligence.
  2. Build feature engineering workflows and ML-ready data pipelines that support Data Science experimentation and production model serving.
  3. Contribute to data governance and quality standards across analytical and ML data products.
  4. Support implementation of GenAI solutions for automated reporting, diagnostic, predictive, and prescriptive analytics.
  5. Build and maintain semantic layers and dashboard data models that power worldwide operations business decisions.

Skills

Required

  • 3+ years of data engineering experience
  • 3+ years of developing and operating large-scale data structures for business intelligence analytics using data modeling experience
  • Experience with data modeling, warehousing and building ETL pipelines
  • Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
  • Experience in data warehouse technical architectures, data modeling, infrastructure components, ETL/ ELT and reporting/analytic tools and environments, data structures and hands-on SQL coding
  • Bachelor's degree or above in computer science, machine learning, engineering, or related fields, or experience including, building and maintaining data flows and pipelines
  • Proficiency in Python and SQL
  • experience with PySpark or Apache Spark
  • Experience with infrastructure-as-code (CDK, CloudFormation) and CI/CD pipelines for data and ML systems
  • Experience with data modeling and relational/non-relational database design

Nice to have

  • Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
  • Master's degree or above in computer science, engineering, analytics, mathematics, statistics, IT or equivalent

What the JD emphasized

  • ML-ready data infrastructure
  • GenAI solutions
  • ML Engineers
  • Data Scientists
  • Applied Scientists
  • MLOps data practices
  • model retraining data support
  • data engineering
  • Data Science
  • AI
  • ML Engineers
  • Data Scientists
  • operational stakeholders
  • data engineering
  • ML systems

Other signals

  • build and maintain scalable data pipelines and ML-ready data infrastructure
  • support implementation of GenAI solutions
  • build feature engineering workflows