Research Engineer, Reward Models

Anthropic Anthropic · AI Frontier · AI Research & Engineering

Research Engineer focused on developing and implementing novel reward modeling architectures and techniques to align AI systems with human values and advance AI capabilities. The role involves optimizing training and data pipelines, collaborating on integrating advances into production systems, and communicating research progress.

What you'd actually do

  1. Help implement novel reward modeling architectures and techniques
  2. Optimize training pipelines
  3. Build and optimize data pipelines
  4. Collaborate across teams to integrate reward modeling advances into production systems
  5. Communicate engineering progress through internal documentation and potential publications

Skills

Required

  • Python
  • deep learning frameworks
  • distributed computing
  • modern LLM architectures
  • alignment techniques
  • model training pipelines
  • data pipelines
  • AI alignment
  • AI safety

Nice to have

  • reward models
  • LLMs
  • large models

What the JD emphasized

  • strong engineering background in machine learning
  • demonstrable expertise in preference learning, reinforcement learning, deep learning, or related areas
  • proficient in Python, deep learning frameworks, and distributed computing
  • familiar with modern LLM architectures and alignment techniques
  • experience with improving model training pipelines and building data pipelines
  • comfortable with the experimental nature of frontier AI research
  • view research and engineering as complementary disciplines and are willing to implement some research ideas
  • can clearly communicate complex technical concepts and research findings
  • have a deep interest in AI alignment and safety
  • Proficiency in Python and experience with deep learning frameworks is required for this role

Other signals

  • reward models
  • AI alignment
  • human values
  • pushing AI capabilities
  • training pipelines
  • data pipelines