Senior Applied Scientist, Safe Locomotion, Compass

Amazon Amazon · Big Tech · Pasadena, CA · Applied Science

Senior Applied Scientist role focused on developing and deploying safe legged locomotion algorithms for robots using Reinforcement Learning (RL), sim-to-real transfer, and integrating learned policies with model-based control. The role involves training policies for dynamic gaits, ensuring safety constraints, and collaborating with other robotics teams.

What you'd actually do

  1. Design, train, and deploy reinforcement learning policies for dynamic legged locomotion including walking, running, stair climbing, and fall recovery on physical quadruped and humanoid platforms
  2. Collaborate with the Compass safety team to ensure locomotion policies operate within safety-critical bounds, incorporating control barrier functions or other formal safety mechanisms as constraints during or after training
  3. Develop sim-to-real transfer pipelines that produce policies robust to the reality gap, including domain randomization, system identification, and adaptive strategies
  4. Integrate learned locomotion policies with model-based whole-body controllers, defining how RL outputs (e.g., joint targets, contact schedules) interface with optimization-based control layers
  5. Formulate reward functions and training curricula that encode both performance objectives and safety constraints, ensuring policies respect stability and contact-force limits

Skills

Required

  • PhD, or Master's degree and 6+ years of applied research experience
  • 3+ years of experience applying RL to physical robotic systems (beyond simulation-only work)
  • Demonstrated expertise in sim-to-real transfer for locomotion or manipulation tasks
  • Strong understanding of legged robot dynamics, contact mechanics, and whole-body control fundamentals
  • Proficiency in Python and deep learning frameworks (e.g., PyTorch, JAX) with experience building custom RL training pipelines
  • Experience with physics simulators for robotics (e.g., Isaac Gym/Sim, MuJoCo, PyBullet)
  • Track record of publications at top-tier venues (e.g., RSS, CoRL, ICRA, NeurIPS, ICLR, IROS)

Nice to have

  • Experience deploying RL-trained locomotion policies on physical quadrupeds or humanoids
  • Familiarity with safety-constrained RL
  • Experience with model-based control (MPC, whole-body QP controllers, operational space control) and how learned policies compose with them
  • Knowledge of stability theory (Lyapunov methods, orbital stability) as it applies to periodic gaits
  • Experience with hierarchical RL, skill composition, or multi-task policy architectures for locomotion
  • Familiarity with real-time deployment constraints (latency budgets, onboard compute limitations, control-loop frequencies)
  • Experience building or contributing to large-scale RL training infrastructure (distributed training, GPU clusters)
  • Strong communication skills and ability to work across disciplinary boundaries (ML, controls, mechanical engineering)

What the JD emphasized

  • physical robotic systems
  • safety-critical context
  • safety constraints
  • safety
  • safety-critical bounds
  • safety mechanisms
  • safety-constrained RL
  • safety is never a blocker

Other signals

  • reinforcement learning
  • sim-to-real transfer
  • legged locomotion
  • safety-critical context
  • physical hardware deployment