Research Engineer, Machine Learning (rl Velocity)

Anthropic Anthropic · AI Frontier · San Francisco, CA · AI Research & Engineering

Research Engineer focused on building and improving the RL training infrastructure and tooling at Anthropic. The role involves identifying and removing bottlenecks in the RL stack, partnering with researchers and other engineering teams, and owning the reliability and performance of research runs to enable faster iteration and shipping of better models at scale.

What you'd actually do

  1. Build and improve the RL training infrastructure that researchers depend on day-to-day
  2. Identify and remove bottlenecks across the RL stack: debugging, profiling, and rearchitecting where needed
  3. Partner closely with researchers and with adjacent engineering teams (inference, sandboxing, and many more) to understand pain points and ship tooling that makes them faster
  4. Own the reliability and performance of research runs end-to-end
  5. Contribute to design decisions that shape how Anthropic does RL at scale

Skills

Required

  • strong software engineering fundamentals
  • building performant, reliable systems
  • ML infrastructure
  • distributed systems
  • research tooling
  • enabling other people's work
  • platforms
  • operating across the stack
  • low-level performance work
  • RL algorithms
  • shipping and iterating quickly
  • high agency
  • low ego

Nice to have

  • large-scale distributed training (RL, pre-training, or post-training)
  • JAX
  • PyTorch
  • operating at the edge of research and infra

What the JD emphasized

  • build performant, reliable systems
  • ML infrastructure
  • research tooling
  • shipping and iterating quickly
  • large-scale distributed training
  • fast-moving environment

Other signals

  • RL training infrastructure
  • research tooling
  • large-scale distributed training
  • performance and reliability of research runs