What you'd actually do

Optimize ML models for latency, memory usage, and inference speed

Quantize, prune, and convert models (e.g., to ONNX, TensorRT, TFLite) for deployment on various platforms (cloud, edge, mobile)

Benchmark and profile model performance across different environments

Package and deploy models as REST APIs, batch jobs, or streaming services using tools like FastAPI, Flask, or gRPC

Implement CI/CD pipelines for automated testing and deployment of ML models

Skills

Required

Python
PyTorch
model optimization tools (e.g., ONNX, TensorRT, TFLite, TVM)
model inference optimization and quantization
containerization and orchestration (Docker, Kubernetes)
cloud platforms (AWS, GCP, Azure)
serverless deployments
software engineering principles
CI/CD pipelines
deploying models to edge devices or mobile platforms
data serialization formats (e.g., protobuf, Avro)

Nice to have

FastAPI
Flask
gRPC
Prometheus
Grafana

Tesla AI is solving robust, real-world AI through humanoid robots. As a Software Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture, visualize data, assist with exporting and deploying neural networks to Tesla’s neural network chip with real-time latency constraints on Optimus, and evaluate experimental results. You will help us automate the entire workflows of training, validation, and production of Optimus. Most importantly, you will see your work repeatedly shipped to and utilized by thousands of Humanoid Robots in real world applications.

Responsibilities

Optimize ML models for latency, memory usage, and inference speed
Quantize, prune, and convert models (e.g., to ONNX, TensorRT, TFLite) for deployment on various platforms (cloud, edge, mobile)
Benchmark and profile model performance across different environments
Package and deploy models as REST APIs, batch jobs, or streaming services using tools like FastAPI, Flask, or gRPC
Implement CI/CD pipelines for automated testing and deployment of ML models
Ensure scalability and reliability of ML services in production environments

Requirements

Strong proficiency in Python and PyTorch
Experience with model optimization tools (e.g., ONNX, TensorRT, TFLite, TVM)
Experience with model inference optimization and quantization
Solid understanding of containerization and orchestration (Docker, Kubernetes)
Familiarity with cloud platforms (AWS, GCP, Azure) and serverless deployments
Strong grasp of software engineering principles and CI/CD pipelines
Experience deploying models to edge devices or mobile platforms
Knowledge of data serialization formats (e.g., protobuf, Avro)
Exposure to observability tools (e.g., Prometheus, Grafana) for ML monitoring

Compensation and Benefits

Benefits

Along with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire:

Medical plans > plan options with $0 payroll deduction
Family-building, fertility, adoption and surrogacy benefits
Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution
Company Paid (Health Savings Accounts) HSA Contribution when enrolled in the High-Deductible medical plan with HSA
Healthcare and Dependent Care Flexible Spending Accounts (FSA)
401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
Company paid Basic Life, AD&D
Short-term and long-term disability insurance (90 day waiting period)
Employee Assistance Program
Sick and Vacation time (Flex time for salary positions, Accrued hours for Hourly positions), and Paid Holidays
Back-up childcare and parenting support resources
Voluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
Weight Loss and Tobacco Cessation Programs
Tesla Babies program
Commuter benefits
Employee discounts and perks program

Expected Compensation

$176,000 - $420,000/annual salary + cash and stock awards + benefits

Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.

Responsibilities

Optimize ML models for latency, memory usage, and inference speed
Quantize, prune, and convert models (e.g., to ONNX, TensorRT, TFLite) for deployment on various platforms (cloud, edge, mobile)
Benchmark and profile model performance across different environments
Package and deploy models as REST APIs, batch jobs, or streaming services using tools like FastAPI, Flask, or gRPC
Implement CI/CD pipelines for automated testing and deployment of ML models
Ensure scalability and reliability of ML services in production environments

Requirements

Strong proficiency in Python and PyTorch
Experience with model optimization tools (e.g., ONNX, TensorRT, TFLite, TVM)
Experience with model inference optimization and quantization
Solid understanding of containerization and orchestration (Docker, Kubernetes)
Familiarity with cloud platforms (AWS, GCP, Azure) and serverless deployments
Strong grasp of software engineering principles and CI/CD pipelines
Experience deploying models to edge devices or mobile platforms
Knowledge of data serialization formats (e.g., protobuf, Avro)
Exposure to observability tools (e.g., Prometheus, Grafana) for ML monitoring

Compensation and Benefits

Benefits

Along with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire:

Medical plans > plan options with $0 payroll deduction
Family-building, fertility, adoption and surrogacy benefits
Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution
Company Paid (Health Savings Accounts) HSA Contribution when enrolled in the High-Deductible medical plan with HSA
Healthcare and Dependent Care Flexible Spending Accounts (FSA)
401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
Company paid Basic Life, AD&D
Short-term and long-term disability insurance (90 day waiting period)
Employee Assistance Program
Sick and Vacation time (Flex time for salary positions, Accrued hours for Hourly positions), and Paid Holidays
Back-up childcare and parenting support resources
Voluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
Weight Loss and Tobacco Cessation Programs
Tesla Babies program
Commuter benefits
Employee discounts and perks program

Expected Compensation

$176,000 - $420,000/annual salary + cash and stock awards + benefits

AI Infrastructure Engineer, Model Optimization & Deployment, Optimus

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Responsibilities

Requirements

Compensation and Benefits

Responsibilities

Requirements

Compensation and Benefits