AI Systems Engineer, Tooling & Infrastructure, Optimus

Tesla Tesla · Auto · Palo Alto, CA · Tesla AI

Software Engineer for the Optimus team building tools and infrastructure for ML Platform automation, data and inference pipelines, and model evaluation for robotic intelligence.

What you'd actually do

  1. Automate data, inference, and auto-labeling pipelines
  2. Build the tooling and infrastructure for reporting and visualizing model metrics and performance
  3. Manage, analyze, and validate our training and test datasets
  4. Coordinate with the team managing the hardware cluster to maintain high availability / jobs throughput for Machine Learning
  5. Drive implementation of best practices and monitoring systems to proactively detect and address issues in our production environment

Skills

Required

  • Python
  • numpy
  • distributed compute systems (k8s, Slurm, LSF, etc.)
  • large datasets / data workflows
  • team environment

Nice to have

  • training deep learning models
  • designing and deploying automation systems for machine learning workflows

What the JD emphasized

  • build the tools and infrastructure to make and measure improvements to neural network architecture by building and automating scalable data and inference pipelines
  • building and automating scalable data and inference pipelines
  • automates the collection, processing, and evaluation of data
  • build this automation platform that translates research breakthroughs into robotic intelligence at scale
  • continuous data integration, automated evaluation, and rapid deployment

Other signals

  • ML Platform
  • automation
  • data pipelines
  • inference pipelines
  • robotic intelligence