Research Engineer, Science of Scaling

Anthropic Anthropic · AI Frontier · London, United Kingdom · AI Research & Engineering

Research Engineer/Scientist on the Science of Scaling team, focused on developing next-generation large language models. The role involves research at the intersection of cutting-edge research and practical engineering, contributing to safe, steerable, and trustworthy AI systems. Responsibilities include research into the science of converting compute into intelligence, leading research projects, designing and analyzing experiments, optimizing training infrastructure, and developing dev tooling. Requires significant software engineering experience, proficiency in Python and deep learning frameworks, and a results-oriented approach. Strong candidates may have experience with JAX, reinforcement learning, high-performance ML systems, accelerators, Kubernetes, OS internals, transformer architectures, large-scale ETL, and distributed training at scale.

What you'd actually do

  1. Conduct research intro the science of converting compute into intelligence
  2. Independently lead small research projects while collaborating with team members on larger initiatives
  3. Design, run, and analyze scientific experiments to advance our understanding of large language models
  4. Optimize training infrastructure to improve efficiency and reliability
  5. Develop dev tooling to enhance team productivity

Skills

Required

  • Python
  • deep learning frameworks
  • software engineering
  • building complex systems
  • research
  • experiment design
  • ML infrastructure optimization
  • dev tooling development

Nice to have

  • JAX
  • reinforcement learning
  • high-performance ML systems
  • accelerators
  • Kubernetes
  • OS internals
  • language modeling
  • transformer architectures
  • large-scale ETL
  • distributed training at scale

What the JD emphasized

  • significant software engineering experience
  • proven track record of building complex systems
  • advanced degree (MS or PhD) in Computer Science, Machine Learning, or a related field
  • proficient in Python and experienced with deep learning frameworks
  • results-oriented with a bias towards flexibility and impact
  • enjoy pair programming and collaborative work
  • willing to take on tasks outside your job description to support the team
  • view research and engineering as two sides of the same coin
  • seeking to understand all aspects of the research program to maximize impact
  • care about the societal impacts of your work
  • ambitious goals for AI safety and general progress
  • Experience with JAX
  • Experience with reinforcement learning
  • Experience working on high-performance, large-scale ML systems
  • Familiarity with accelerators, Kubernetes, and OS internals
  • Experience with language modeling using transformer architectures
  • Background in large-scale ETL processes
  • Experience with distributed training at scale (thousands of accelerators)

Other signals

  • developing the next generation of large language models
  • science of converting compute into intelligence
  • design, run, and analyze scientific experiments
  • optimize training infrastructure
  • distributed training at scale