Data Engineer, Finauto, Accounts Receivable Data Engineering

Amazon Amazon · Big Tech · IN, TS, Hyderabad · Software Development

Data Engineer role focused on building an AI-First, Native-AI data platform for financial automation. The role involves creating AI-ready data products, modernizing architecture with technologies like Zero ETL and AWS DataZone, and ensuring data quality, governance, and observability for AI consumption. Requires expertise in AWS data services and data engineering fundamentals, with a strong interest in AI-driven analytics.

What you'd actually do

  1. Build and maintain scalable, reliable data pipelines and datasets on AWS (S3, EMR, Redshift, Glue) to support Finance analytics and reporting
  2. Develop and enhance AI-ready and analytics-ready data products, ensuring high quality, usability, and clear data definitions
  3. Implement robust ETL/ELT workflows and contribute to modern patterns such as Zero ETL, data mesh, and standardized ingestion frameworks
  4. Ensure end-to-end data quality, observability, and governance through validation, monitoring, lineage, and metadata management (e.g., AWS DataZone)
  5. Collaborate with cross-functional teams (analytics, finance, data science) to translate business needs into scalable, well-modeled data solutions

Skills

Required

  • 1+ years of data engineering experience
  • Experience with data modeling
  • Experience with warehousing
  • Experience building ETL pipelines
  • Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
  • Experience with one or more scripting language (e.g., Python, KornShell)
  • ETL/ELT
  • distributed processing
  • data modeling
  • orchestration
  • reporting systems
  • building scalable and reliable data pipelines handling massive datasets with high availability and performance requirements
  • dimensional modeling
  • data governance
  • lineage
  • observability
  • data quality frameworks
  • feature-ready datasets
  • vectorizable data structures
  • metadata management
  • semantic discoverability
  • reusable data products
  • Excellent problem-solving abilities
  • ownership mindset
  • simplifying complex data ecosystems
  • Strong written and verbal communication skills
  • partnering across engineering, finance, product, and senior leadership teams

Nice to have

  • Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
  • Experience with any ETL tool like, Informatica, ODI, SSIS, BODI, Datastage, etc.
  • Zero ETL
  • AWS DataZone
  • end-to-end lineage
  • real-time observability
  • automated data quality frameworks
  • data mesh
  • data contracts

What the JD emphasized

  • AI-First
  • Native-AI
  • AI-ready
  • agentic applications
  • AI consumption
  • AI-compatible
  • AI-driven analytics
  • AI/ML data requirements

Other signals

  • AI-First, Native-AI platform
  • AI-ready, high-scale financial data products
  • power analytics, machine learning, generative AI, and agentic applications
  • trusted datasets optimized for AI consumption
  • AI-compatible data platforms
  • AI-driven analytics