Principal ML Engineer, Machine Learning Platform and Systems Architecture

Autodesk Autodesk · Enterprise · Boston, MA +20 · Remote

Autodesk is seeking a Principal ML Engineer to lead the design and evolution of large-scale machine learning platforms, focusing on ML infrastructure, data systems, model lifecycle tooling, and production architecture. This role involves owning high-impact technical initiatives, defining technical direction for ML systems, and driving execution across cross-functional teams. The ideal candidate has strong system architecture, distributed computing, and end-to-end platform thinking skills.

What you'd actually do

  1. Lead architecture and delivery for major ML platform capabilities across training, evaluation, deployment, and observability
  2. Design scalable systems for distributed training, data processing, feature and model lifecycle management, and production inference
  3. Own platform-level technical outcomes from design through deployment, operations, and continuous improvement
  4. Drive the design and scaling of data pipelines for large-scale structured and semi-structured technical datasets
  5. Lead architecture for distributed data processing and orchestration systems such as Ray, Airflow, Spark, or similar platforms

Skills

Required

  • Python
  • software engineering practices
  • large-scale data pipelines
  • distributed data processing
  • cloud-native platform architectures
  • model deployment
  • inference systems
  • production observability
  • architecture decisions
  • performance
  • scalability
  • reliability
  • cost
  • communication
  • stakeholder management

Nice to have

  • data governance
  • lineage
  • provenance
  • geometry
  • graph
  • hierarchical
  • multimodal data
  • distributed ML frameworks
  • large-scale training infrastructure
  • Kubernetes
  • workflow orchestration systems
  • modern ML platform tooling
  • incident leadership
  • service reviews
  • resiliency practices
  • operational readiness
  • AEC data
  • computational design workflows
  • BIM/CAD ecosystems
  • Autodesk products

What the JD emphasized

  • production software engineering practices
  • large-scale data pipelines
  • distributed data processing
  • cloud-native platform architectures
  • model deployment
  • inference systems
  • production observability

Other signals

  • ML platform
  • systems architecture
  • distributed systems
  • production inference
  • model lifecycle tooling