Phd Autonomy Engineer Intern - Planning & Controls (reinforcement Learning)

Skydio · Defense · Zurich, Switzerland · R&D

PhD intern role focused on developing and deploying reinforcement learning policies for autonomous drone planning, navigation, and control in complex environments. Involves research in sim-to-real transfer, human-in-the-loop control, and multi-agent coordination, with a strong emphasis on real-world flight testing.

What you'd actually do

  1. Train policies that adapt online to cluttered 3D scenes (forests, bridges, urban canyons), complementing our geometric stack for robust obstacle avoidance and dynamic goal-seeking.
  2. Fuse learned cost shaping / value functions with trajectory optimization for smooth, agile flight with tight safety envelopes and mission constraints.
  3. Build scalable datasets and training loops with Isaac Lab, domain randomization, residual learning, and safety filters; validate on real drones weekly.
  4. Learn assistive policies that blend pilot intent, autonomy priors, and uncertainty-aware behaviors for intuitive control handoffs.
  5. Explore decentralized coordination for coverage, pursuit, and collaborative mapping with minimal comms.

Skills

Required

  • PhD student in Robotics, Machine Learning, Controls, or related field
  • Strong fundamentals in RL, control theory, and motion planning
  • Comfort with safety/robustness concepts
  • Proficient in Python (PyTorch/JAX/Ray RLlib)
  • Proficient in at least one of C++ or CUDA
  • Hands-on experience with robotics simulation (Isaac Lab/MuJoCo/PyBullet)
  • Hands-on experience with sim2real techniques
  • Experience training/deploying policies for navigation, manipulation, or locomotion on real robots or autonomous vehicles

Nice to have

  • Publications (CoRL, ICRA, IROS, RSS, NeurIPS)
  • Experience with onboard inference optimization (TensorRT, quantization, sparsity)
  • Familiarity with modern policy learning beyond vanilla RL: diffusion policies, IL/BC, offline RL, model-based RL
  • Experience with multi-agent RL or distributed training

What the JD emphasized

  • reinforcement learning
  • plan, navigate, and control
  • robotics simulation
  • sim2real

Other signals

  • reinforcement learning
  • autonomous flight
  • robotics simulation
  • sim2real