What you'd actually do

Build and maintain scalable, reliable data pipelines and datasets on AWS (S3, EMR, Redshift, Glue) to support Finance analytics and reporting

Develop and enhance AI-ready and analytics-ready data products, ensuring high quality, usability, and clear data definitions

Implement robust ETL/ELT workflows and contribute to modern patterns such as Zero ETL, data mesh, and standardized ingestion frameworks

Ensure end-to-end data quality, observability, and governance through validation, monitoring, lineage, and metadata management (e.g., AWS DataZone)

Collaborate with cross-functional teams (analytics, finance, data science) to translate business needs into scalable, well-modeled data solutions

Skills

Required

1+ years of data engineering experience
Experience with data modeling
Experience with warehousing
Experience building ETL pipelines
Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
Experience with one or more scripting language (e.g., Python, KornShell)
ETL/ELT
distributed processing
data modeling
orchestration
reporting systems
building scalable and reliable data pipelines handling massive datasets with high availability and performance requirements
dimensional modeling
data governance
lineage
observability
data quality frameworks
feature-ready datasets
vectorizable data structures
metadata management
semantic discoverability
reusable data products
Excellent problem-solving abilities
ownership mindset
simplifying complex data ecosystems
Strong written and verbal communication skills
partnering across engineering, finance, product, and senior leadership teams

Nice to have

Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
Experience with any ETL tool like, Informatica, ODI, SSIS, BODI, Datastage, etc.
Zero ETL
AWS DataZone
end-to-end lineage
real-time observability
automated data quality frameworks
data mesh
data contracts

After building a proven Data Mesh architecture across thousands of users, we are re-imagining our data ecosystem as an AI-First, Native-AI platform that powers the next generation of intelligent financial automation. FinAuto team at Amazon is looking for a Data Engineer to play a critical role in building AI-ready, high-scale financial data products that enable autonomous insights, intelligent workflows, and data-driven decision making at enterprise scale. This is a unique opportunity to help shape a Native-AI data foundation by building highly discoverable, governed, and reusable data products that power analytics, machine learning, generative AI, and agentic applications across Amazon Finance. You will help modernize our architecture using technologies such as Zero ETL, AWS DataZone, end-to-end lineage, real-time observability, and automated data quality frameworks to create trusted datasets optimized for AI consumption.

The ideal candidate is passionate about designing next-generation distributed data systems on AWS and believes in democratizing access to high-quality data to accelerate innovation. You will work on building scalable, secure, and AI-compatible data platforms that enable self-service analytics, semantic data discovery, and intelligent financial operations. We are looking for candidates with strong expertise in AWS technologies such as S3, EMR, Redshift, Glue, and Lake Formation, or deep experience in traditional BI and Data Warehousing systems, combined with a strong interest in AI-driven analytics, data mining, and pattern discovery from large-scale datasets. Experience designing and managing data products, metadata-driven architectures, and data contracts will be highly valued.

The ideal candidate should possess:

Strong expertise in core data engineering fundamentals including ETL/ELT, distributed processing, data modeling, orchestration, and reporting systems
Experience building scalable and reliable data pipelines handling massive datasets with high availability and performance requirements
Good understanding of dimensional modeling, data governance, lineage, observability, and data quality frameworks
Familiarity with AI/ML data requirements including feature-ready datasets, vectorizable data structures, metadata management, and semantic discoverability
Ability to think in terms of reusable data products instead of siloed pipelines
Excellent problem-solving abilities, ownership mindset, and a passion for simplifying complex data ecosystems
Strong written and verbal communication skills with experience partnering across engineering, finance, product, and senior leadership teams

Join our exceptional team where you'll solve complex data challenges while working alongside industry-leading engineers building Amazon’s AI-First finance data ecosystem. You will have the opportunity to influence the future of enterprise-scale data products, contribute to Native-AI platform modernization, and build systems that power intelligent automation for one of the world’s largest FinOps organizations.

Key job responsibilities

Build and maintain scalable, reliable data pipelines and datasets on AWS (S3, EMR, Redshift, Glue) to support Finance analytics and reporting
Develop and enhance AI-ready and analytics-ready data products, ensuring high quality, usability, and clear data definitions
Implement robust ETL/ELT workflows and contribute to modern patterns such as Zero ETL, data mesh, and standardized ingestion frameworks
Ensure end-to-end data quality, observability, and governance through validation, monitoring, lineage, and metadata management (e.g., AWS DataZone)
Collaborate with cross-functional teams (analytics, finance, data science) to translate business needs into scalable, well-modeled data solutions

About the team The AR Data Engineering (ARDE) team develops and maintains robust data solutions that power Global Accounts Receivable operations. We build comprehensive datasets for collections, cash management, customer contacts, and billing processes that drive critical business insights. Our scalable platform transforms how AR teams operate by providing near-time visibility into cash flow and collections, optimizing working capital management, enhancing customer experience, and enabling secure, efficient data access and analysis. Through our sophisticated data infrastructure, we ensure AR teams can easily discover, access, and analyze the right information at the right time, empowering better decision-making across the organization."

Basic Qualifications

1+ years of data engineering experience
Experience with data modeling, warehousing and building ETL pipelines
Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
Experience with one or more scripting language (e.g., Python, KornShell)

Preferred Qualifications

Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
Experience with any ETL tool like, Informatica, ODI, SSIS, BODI, Datastage, etc.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The ideal candidate should possess:

Strong expertise in core data engineering fundamentals including ETL/ELT, distributed processing, data modeling, orchestration, and reporting systems
Experience building scalable and reliable data pipelines handling massive datasets with high availability and performance requirements
Good understanding of dimensional modeling, data governance, lineage, observability, and data quality frameworks
Familiarity with AI/ML data requirements including feature-ready datasets, vectorizable data structures, metadata management, and semantic discoverability
Ability to think in terms of reusable data products instead of siloed pipelines
Excellent problem-solving abilities, ownership mindset, and a passion for simplifying complex data ecosystems
Strong written and verbal communication skills with experience partnering across engineering, finance, product, and senior leadership teams

Key job responsibilities

Build and maintain scalable, reliable data pipelines and datasets on AWS (S3, EMR, Redshift, Glue) to support Finance analytics and reporting
Develop and enhance AI-ready and analytics-ready data products, ensuring high quality, usability, and clear data definitions
Implement robust ETL/ELT workflows and contribute to modern patterns such as Zero ETL, data mesh, and standardized ingestion frameworks
Ensure end-to-end data quality, observability, and governance through validation, monitoring, lineage, and metadata management (e.g., AWS DataZone)
Collaborate with cross-functional teams (analytics, finance, data science) to translate business needs into scalable, well-modeled data solutions

Basic Qualifications

1+ years of data engineering experience
Experience with data modeling, warehousing and building ETL pipelines
Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
Experience with one or more scripting language (e.g., Python, KornShell)

Preferred Qualifications

Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
Experience with any ETL tool like, Informatica, ODI, SSIS, BODI, Datastage, etc.

Data Engineer, Finauto, Accounts Receivable Data Engineering

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Basic Qualifications

Preferred Qualifications

Basic Qualifications

Preferred Qualifications