AI Training Infrastructure Engineer – Humanoid Whole Body Control

Figure AI Figure AI · Robotics · HQ · Controls

Figure AI is seeking an AI Training Infrastructure Engineer to own and scale the training and deployment backbone for their RL-based whole-body control systems for humanoid robots. This role involves building and optimizing infrastructure for simulation, data pipelines, orchestration, and visualization, with a focus on accelerating iteration cycles and enabling the deployment of new capabilities to their robot fleet. The position requires strong software engineering skills in Python and PyTorch, experience with robotics/ML training infrastructure, familiarity with physics simulators, and knowledge of reinforcement learning.

What you'd actually do

  1. Own and scale the infrastructure used to train whole-body control policies (simulation, data pipelines, orchestration, visualizations)
  2. Design systems that are fast, reliable, and highly configurable for our controls engineers
  3. Ensure high cluster utilization and minimal downtime—unblocking the team and accelerating iteration cycles
  4. Evaluate and integrate physics engines, simulation environments, and parameterizations to balance realism and training speed
  5. Optimize hyperparameters and infrastructure to maximize training speed and efficiency and final model performance

Skills

Required

  • Strong software engineering fundamentals
  • production experience in Python and PyTorch
  • Experience building or scaling training infrastructure for robotics, control systems, or large-scale ML workloads
  • Familiarity with physics simulation tools such as NVIDIA PhysX, MuJoCo, Warp, or PyBullet
  • Working knowledge of dynamics, controls, and robotics systems
  • Experience with reinforcement learning, imitation learning, or policy distillation
  • Strong ownership mindset
  • Experience modeling contact interactions and photorealistic simulation environments for complex manipulation tasks

Nice to have

  • Experience with humanoid or legged robot control
  • Background in distributed systems, job schedulers, or cluster management
  • Experience deploying ML models or control policies to real-world systems

What the JD emphasized

  • production experience
  • scaling training infrastructure
  • reinforcement learning
  • ownership mindset

Other signals

  • training infrastructure
  • RL-based whole-body control
  • simulation
  • data pipelines
  • orchestration
  • deployment backbone