Machine Learning Systems Engineer, RL Engineering

Anthropic Anthropic · AI Frontier · AI Research & Engineering

This role focuses on building, maintaining, and improving the critical algorithms and infrastructure for training AI models, specifically using RLHF and other advanced techniques. The goal is to enhance the performance, robustness, speed, reliability, and usability of these training systems to enable breakthroughs in AI capabilities and safety.

What you'd actually do

  1. Build, maintain, and improve the algorithms and systems that researchers use to train models.
  2. Improve the speed, reliability, and ease-of-use of these systems.
  3. Support and empower the research team in the mission to build beneficial AI systems.
  4. Implement and improve advanced techniques to create ever more capable, reliable and steerable AI.
  5. Focus obsessively on improving the performance, robustness, and usability of these systems so our research can progress as quickly as possible.

Skills

Required

  • 4+ years of software engineering experience
  • Python
  • Implementing LLM finetuning algorithms, such as RLHF

Nice to have

  • High performance, large scale distributed systems
  • Large scale LLM training

What the JD emphasized

  • 4+ years of software engineering experience
  • Implementing LLM finetuning algorithms, such as RLHF

Other signals

  • building systems that train AI models
  • implementing and improving advanced techniques
  • critical algorithms and infrastructure
  • enabling breakthroughs in AI capabilities and safety
  • improving the performance, robustness, and usability of these systems
  • improving the speed, reliability, and ease-of-use of these systems
  • implementing LLM finetuning algorithms, such as RLHF