Research Engineer, Production Model Post-training

Anthropic Anthropic · AI Frontier · Zürich, Switzerland · AI Research & Engineering

Research Engineer focused on post-training of production LLMs, implementing and optimizing techniques like Constitutional AI and RLHF to enhance model capabilities, alignment, and safety. Involves research, pipeline development, evaluation, and debugging at scale.

What you'd actually do

  1. Implement and optimize post-training techniques at scale on frontier models
  2. Conduct research to develop and optimize post-training recipes that directly improve production model quality
  3. Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation
  4. Develop tools to measure and improve model performance across various dimensions
  5. Collaborate with research teams to translate emerging techniques into production-ready implementations

Skills

Required

  • Python
  • deep learning frameworks
  • distributed computing
  • building complex ML systems
  • analyzing and debugging model training processes

Nice to have

  • LLMs
  • AI safety
  • responsible deployment

What the JD emphasized

  • Python
  • large-scale distributed systems
  • high-performance computing
  • training, fine-tuning, or evaluating large language models
  • analyzing and debugging model training processes

Other signals

  • train production models
  • post-training techniques
  • alignment methodologies
  • large-scale distributed systems
  • high-performance computing
  • large language models