Principal Machine Learning Engineer

Autodesk Autodesk · Enterprise · Toronto, ON +1

Principal Machine Learning Engineer at Autodesk, focusing on integrating AI into AEC products. The role involves exploring and analyzing complex AEC datasets (BIM, drawings, point clouds), building and curating datasets for GenAI, prototyping workflows, and collaborating with AI researchers and product teams. The primary focus is on data preparation and understanding for AI applications within the AEC domain, with secondary involvement in model development and evaluation.

What you'd actually do

  1. Explore and make sense of AEC data at scale: Dive into complex design and construction datasets (e.g. BIM models, drawings, geometry, point clouds, metadata) to uncover patterns, anomalies, and opportunities, translating raw data into meaningful insights and narratives
  2. Tell compelling data-driven stories: Synthesize findings into clear, impactful visualizations, prototypes, and narratives that influence product direction, research investments, and AI strategy
  3. Build and curate high-quality datasets for ML/GenAI: Design data pipelines and workflows to extract, clean, structure, and label large-scale AEC datasets (geometry, text, images, point clouds, embeddings) for downstream machine learning applications
  4. Collaborate across disciplines to explore ambiguous problems: Partner with ML engineers, researchers, product managers, and designers to define open-ended questions, frame experiments, and iterate toward meaningful solutions
  5. Design and implement scalable data and ML pipelines: Architect and develop robust pipelines for processing and analyzing large datasets, ensuring reproducibility, scalability, and efficiency

Skills

Required

  • Python
  • TypeScript
  • Data visualization
  • Data storytelling
  • Data pipelines
  • Machine learning
  • Data analysis
  • Cloud platforms (AWS, Azure, or GCP)
  • Scalable data processing
  • Software engineering fundamentals
  • Agile environments
  • AEC domain knowledge

Nice to have

  • Experience with AEC data formats and workflows (e.g., BIM, IFC, CAD)
  • Experience with Revit, AutoCAD, or Autodesk Forma
  • Experience with infrastructure or reality capture data, including point clouds and LiDAR (e.g., using tools like Autodesk ReCap)

What the JD emphasized

  • complex design and construction datasets
  • high-quality datasets for ML/GenAI
  • ambiguous problem spaces
  • AEC data at scale

Other signals

  • Exploratory analysis of AEC data
  • Prototype new workflows
  • Build and curate high-quality datasets
  • Collaborate with AI researchers and ML engineers
  • Influence product direction and AI capabilities