Data Engineer (python, Spark)

Autodesk Autodesk · Enterprise · Bangalore, India

Autodesk is seeking a Data Engineer to join their Access domain, focusing on building and maintaining data engineering pipelines using platforms like Hive, Spark, Flink, and cloud services on AWS. The role involves designing, building, and maintaining scalable data pipelines and models, modernizing legacy workflows, developing ETL/ELT processes, and enabling analytics. The candidate will act as a Data Champion, driving data quality, reliability, and observability standards.

What you'd actually do

  1. Design, build, and maintain scalable data pipelines and data models across Access
  2. Modernize legacy data workflows and infrastructure, including migrations from platforms such as Hive to Iceberg
  3. Develop reliable ETL/ELT workflows to ingest, transform, and serve data for analytics and operational use cases
  4. Own critical data pipelines end-to-end and contribute to improving the overall data platform
  5. Enable analytics and provide critical insights around product usage, campaign performance, funnel metrics, segmentation, conversion, and revenue growth

Skills

Required

  • SQL
  • Python
  • Spark
  • Airflow
  • Snowflake
  • AWS
  • Looker
  • Power BI
  • Git
  • Jenkins CI
  • Flink
  • data modeling
  • pipeline reliability
  • large-scale data processing
  • Jupyter
  • EMR Notebooks
  • Apache Zeppelin
  • AI-assisted development tools

Nice to have

  • building or using AI-powered tools, agents, or automation in data workflows
  • Model Context Protocol (MCP)
  • data platform enhancements, optimizations, or small-scale migrations
  • data observability and monitoring practices