Research Engineer / Research Scientist, Pre-training

Anthropic Anthropic · AI Frontier · Zürich, Switzerland · AI Research & Engineering

Research Engineer/Scientist focused on pre-training large language models, with an emphasis on multimodal capabilities. The role involves research, implementation, experimentation, and optimization of training infrastructure and model architectures, contributing to the development of safe and steerable AI systems.

What you'd actually do

  1. Conduct research and implement solutions in areas such as model architecture, algorithms, data processing, and optimizer development
  2. Independently lead small research projects while collaborating with team members on larger initiatives
  3. Design, run, and analyze scientific experiments to advance our understanding of large language models
  4. Optimize and scale our training infrastructure to improve efficiency and reliability
  5. Develop and improve dev tooling to enhance team productivity

Skills

Required

  • Python
  • deep learning frameworks
  • building complex systems
  • high-performance, large-scale ML systems
  • large-scale data processing
  • problem-solving skills
  • collaborative environment

Nice to have

  • MS or PhD
  • ML Accelerators
  • Kubernetes
  • pair programming

What the JD emphasized

  • proven track record of building complex systems
  • high-performance, large-scale ML systems
  • large-scale data processing
  • large-scale AI research projects

Other signals

  • developing the next generation of large language models
  • multimodal capabilities
  • large-scale ML systems
  • scaling distributed training jobs