What you'd actually do

Design, develop and maintain scaled, automated, user-friendly systems, reports, dashboards, etc.

Partner with operations/business teams/economist/ML teams to consult, develop and implement KPI's, automated reporting/process solutions and data infrastructure improvements to meet business needs.

Build and maintain data infrastructure for AI agent systems, including vector databases, embedding pipelines, and retrieval-augmented generation (RAG) data stores.

Design data architectures that enable agentic workflows - structured data access layers, tool-use APIs, context management systems that AI agents consume autonomously, self-serve analytics.

Develop observability and evaluation pipelines for LLM-powered features, including tracking model performance, hallucination rates, latency, and cost metrics at scale.

Skills

Required

5+ years of data engineering experience
Experience with data modeling, warehousing and building ETL pipelines
Experience with SQL
Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
Experience mentoring team members on best practices
Experience with AI/ML technologies

Nice to have

Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
Experience operating large data warehouses

Other signals

building data infrastructure for AI/ML and LLM-based systems

designing pipelines that feed agentic workflows and retrieval-augmented generation (RAG) architectures

build and maintain data infrastructure for AI agent systems, including vector databases, embedding pipelines, and retrieval-augmented generation (RAG) data stores

design data architectures that enable agentic workflows

develop observability and evaluation pipelines for LLM-powered features

How often have you had an opportunity to be a member of a team that is tasked with solving customer needs through disruptive and innovative technology? Everyone on the team needs to be entrepreneurial, wear many hats and work in a fast-paced, ambiguous, and highly collaborative environment that’s more startup than big company. If this sounds intriguing, then we’d like to talk to you about a role on the Amazon Defect Elimination Analytics team. This team drives Amazon towards a defect-free customer experience by building technology that rapidly identifies defects, associates them with the information required to resolve the root cause, and prioritizes the multitude of improvement opportunities based on business and customer needs. To continue expanding our defect elimination program, we seek a passionate, results-oriented, Senior Data Engineer.

The Senior Data Engineer will partner with Software Developers, Research Scientists, Business Intelligence Engineers, Program & Product Managers to provide insights on customer feedback, create key performance indicators for our products, and assist in feature engineering and model development. You will also play a key role in building the data infrastructure that powers AI/ML and LLM-based systems, including designing pipelines that feed agentic workflows and retrieval-augmented generation (RAG) architectures. The ideal candidate has strong business judgment, organization skills, backbone, experience measuring product performance, and collaborates well with product owners to answer key questions. The operating environment is fast paced and dynamic, however has a strong team orinted and welcoming culture. To thrive, you must be detail oriented, enthusiastic and flexible, in return you will gain tremendous experience with the latest in big data technologies, generative AI infrastructure, and exposure to statistical and Natural Language modeling through collaboration with scientists on global issue detection models and development.

Key job responsibilities

Design, develop and maintain scaled, automated, user-friendly systems, reports, dashboards, etc.
Partner with operations/business teams/economist/ML teams to consult, develop and implement KPI's, automated reporting/process solutions and data infrastructure improvements to meet business needs.
Build and maintain data infrastructure for AI agent systems, including vector databases, embedding pipelines, and retrieval-augmented generation (RAG) data stores.
Design data architectures that enable agentic workflows - structured data access layers, tool-use APIs, context management systems that AI agents consume autonomously, self-serve analytics.
Develop observability and evaluation pipelines for LLM-powered features, including tracking model performance, hallucination rates, latency, and cost metrics at scale.
Apply analytic skill to extract meaningful insights and learnings from large and complicated data sets, including unstructured text corpora used for generative AI applications.
Serve as liaison with Business and technical teams to achieve project objectives, requiring data gathering, problem solving, modeling, and communication of insights and recommendations.
Stay current with advances in AI/ML data infrastructure (e.g., feature stores, vector search, streaming inference pipelines) and evaluate their applicability to defect elimination use cases.

A day in the life If you are not sure that every qualification on the list above describes you exactly, we'd still love to hear from you! At Amazon, we value people with unique backgrounds, experiences, and skillsets. If you’re passionate about this role and want to make an impact on a global scale, please apply!

Basic Qualifications

5+ years of data engineering experience
Experience with data modeling, warehousing and building ETL pipelines
Experience with SQL
Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
Experience mentoring team members on best practices
Experience with AI/ML technologies

Preferred Qualifications

Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
Experience operating large data warehouses

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, TX, Austin - 154,600.00 - 209,100.00 USD annually

Key job responsibilities

Design, develop and maintain scaled, automated, user-friendly systems, reports, dashboards, etc.
Partner with operations/business teams/economist/ML teams to consult, develop and implement KPI's, automated reporting/process solutions and data infrastructure improvements to meet business needs.
Build and maintain data infrastructure for AI agent systems, including vector databases, embedding pipelines, and retrieval-augmented generation (RAG) data stores.
Design data architectures that enable agentic workflows - structured data access layers, tool-use APIs, context management systems that AI agents consume autonomously, self-serve analytics.
Develop observability and evaluation pipelines for LLM-powered features, including tracking model performance, hallucination rates, latency, and cost metrics at scale.
Apply analytic skill to extract meaningful insights and learnings from large and complicated data sets, including unstructured text corpora used for generative AI applications.
Serve as liaison with Business and technical teams to achieve project objectives, requiring data gathering, problem solving, modeling, and communication of insights and recommendations.
Stay current with advances in AI/ML data infrastructure (e.g., feature stores, vector search, streaming inference pipelines) and evaluate their applicability to defect elimination use cases.

Basic Qualifications

5+ years of data engineering experience
Experience with data modeling, warehousing and building ETL pipelines
Experience with SQL
Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
Experience mentoring team members on best practices
Experience with AI/ML technologies

Preferred Qualifications

Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
Experience operating large data warehouses

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

USA, TX, Austin - 154,600.00 - 209,100.00 USD annually

Senior Data Engineer, Amazon Customer Service

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Basic Qualifications

Preferred Qualifications

Basic Qualifications

Preferred Qualifications