Senior Engineering Manager, Reinforcement Learning Environments (rle)

Handshake · Enterprise · San Francisco, CA · Engineering

Senior Engineering Manager to lead the Reinforcement Learning Environments (RLE) team, responsible for building interactive sandboxes that simulate end-to-end workflows for frontier models. The team generates high-signal interaction data used for training and evaluating models on task completion, quality, and robustness. The role involves leading a team of engineers, owning the RLE roadmap, driving architecture for scalable systems, and ensuring reliability and data quality.

What you'd actually do

  1. Lead, hire, and develop a high-performing team building RL environments and the platform behind them
  2. Own the RLE roadmap and execution in close partnership with Research, Product, and Operations
  3. Drive architecture for scalable, reliable, extensible environment systems and data generation pipelines
  4. Build modular, plug-and-play domains that integrate cleanly with training and evaluation loops
  5. Raise the bar on reliability, observability, performance, and data quality

Skills

Required

  • 3+ years managing teams
  • 5+ years hands-on engineering experience
  • Experience leading senior engineers
  • proven ability to align cross-functionally and deliver in fast-moving, unclear problem spaces
  • strong platform/distributed systems background
  • ability to turn research/ops needs into a clear roadmap, ship iteratively, and measure outcomes

Nice to have

  • Experience with RL training infrastructure, simulation systems, or evaluation platforms
  • Human-in-the-loop systems (annotation, rubric tooling, QA pipelines, workflow platforms)
  • Operations-heavy, tech-enabled environment experience
  • Familiarity with AWS/GCP, APIs, Docker, and modern stacks (TypeScript/Node, React)
  • Experience building systems used by applied ML or AI research teams
  • managing an EM (or equivalent scope)

What the JD emphasized

  • 5 days/week (no remote/hybrid)
  • Execution in ambiguity
  • strong engineering fundamentals

Other signals

  • leading RL environments team
  • building interactive sandboxes for frontier models
  • generating high-signal interaction data for training and evaluation