Senior Data Developer, Analytics & Insights

Autodesk Autodesk · Enterprise · Toronto, ON +5 · Remote

Autodesk is seeking a Senior Data Developer to join their Platform Strategy & Emerging Technologies (PSET) organization. The role focuses on developing and enhancing scalable data pipelines using big data technologies, with an emphasis on an AI-native data engineering mindset. Responsibilities include designing, developing, and maintaining ELT/ETL pipelines, ensuring data quality and governance, and collaborating with various stakeholders. The position requires strong Python and SQL skills, experience with cloud platforms (AWS), and data processing frameworks like Spark. Experience with AI-native data engineering technologies, structuring data for AI reasoning, and optimizing agentic workflows is preferred.

What you'd actually do

  1. Design, develop, automate and maintain scalable, robust and reliable ELT/ETL data pipelines that collect, process and transform large volumes of structure and unstructured data from various sources
  2. Maintain and enhance our existing data architecture to ensure smooth and efficient data flow across platforms
  3. Interface with data peers, product managers and cross-functional stakeholders to gather requirements, sequence work, and document technical solutions
  4. Implement best practices for data quality, integrity and governance, including monitoring, validation and auditing processes to ensure reliable and consistent data availability
  5. Contribute to a team culture that values quality, robustness, and scalability while fostering initiatives and innovation by staying up to date with industry trends and new technologies

Skills

Required

  • 5+ years of data processing and data engineering experience
  • Python
  • SQL
  • dimensional modeling
  • analytical data warehouses (Snowflake, Presto/Hive)
  • Data Engineering best practices for medium to large scale production workloads
  • big data processing frameworks (Spark, Hadoop)
  • data pipeline orchestration tools (Airflow)
  • processing semi-structured file formats (Json, parquet)
  • communication skills
  • problem solver
  • Bachelor’s degree in computer science, data science, or related fields

Nice to have

  • Jinja
  • Shell scripting
  • DBT
  • Spark SQL
  • serverless technologies (AWS glue, lambda, EMR, EKS)
  • remote development using AWS SDK
  • ETL and ELT pipelines
  • traditional ETL tools (Airflow, Talend, Informatica)
  • modern ELT frameworks (dbt, Snowflake)
  • AWS IAM roles, permissions
  • Terraform for AWS resource provisioning
  • AWS networking (VPC, security groups, cross-account permissions)
  • REST API design and implementation
  • containers
  • infrastructure-as-code principles
  • automation frameworks (Git, Jenkins, Terraform)
  • Master’s degree in computer science, data science, or related fields
  • AI-native data engineering technologies
  • MCP (Model, Context, Prompt) related context engineering
  • structure data for enhanced AI reasoning
  • reducing latency for complex agentic workflows

What the JD emphasized

  • AI-native data engineering mindset
  • ML-driven validation
  • anomaly detection
  • automated schema evolution
  • automated data lineage
  • optimizations for pipeline performance
  • resource efficiency
  • structure data for enhanced AI reasoning
  • reducing latency for complex agentic workflows

Other signals

  • AI-native data engineering mindset
  • ML-driven validation
  • anomaly detection
  • automated schema evolution
  • automated data lineage
  • optimizations for pipeline performance
  • resource efficiency
  • structure data for enhanced AI reasoning
  • reducing latency for complex agentic workflows