Research Engineer / Research Scientist, Tokens

Anthropic Anthropic · AI Frontier · San Francisco, CA · AI Research & Engineering

Research Engineer/Scientist role focused on building large-scale ML systems, touching all parts of code and infrastructure, from cluster reliability and job efficiency to running scientific experiments and improving dev tooling. The role involves optimizing ML systems, comparing model variants, scaling training jobs, and designing fault tolerance strategies, with a focus on safe, steerable, and trustworthy AI.

What you'd actually do

  1. You want to build large scale ML systems from the ground up.
  2. You care about making safe, steerable, trustworthy systems.
  3. As a Research Engineer, you'll touch all parts of our code and infrastructure, whether that's making the cluster more reliable for our big jobs, improving throughput and efficiency, running and designing scientific experiments, or improving our dev tooling.
  4. You're excited to write code when you understand the research context and more broadly why it's important.

Skills

Required

  • significant software engineering experience
  • results-oriented
  • flexibility and impact
  • pair programming
  • learn about machine learning research
  • societal impacts of your work

Nice to have

  • High performance, large-scale ML systems
  • GPUs, Kubernetes, Pytorch, or OS internals
  • Language modeling with transformers
  • Reinforcement learning
  • Large-scale ETL

What the JD emphasized

  • significant software engineering experience
  • high performance, large-scale ML systems
  • language modeling with transformers
  • reinforcement learning
  • large-scale ETL

Other signals

  • large scale ML systems
  • scientific experiments
  • optimizing throughput
  • compute efficiency
  • distributed training