Research Engineer, Machine Learning (horizons)

Anthropic Anthropic · AI Frontier · AI Research & Engineering

Research Engineer focused on advancing LLM capabilities and safety through fundamental research in reinforcement learning, improving reasoning (code, math), and exploring RL for agentic tasks. Involves developing novel RL techniques, creating tools for models to interact with, and designing experiments.

What you'd actually do

  1. Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models.
  2. Create tools and environments for models to interact with, enabling them to perform complex, open-ended tasks.
  3. Design and run experiments to enhance models' reasoning capabilities, particularly in code generation and mathematics

Skills

Required

  • Python
  • deep learning frameworks (PyTorch or Jax)
  • software engineering
  • pair programming
  • code quality
  • testing
  • performance

Nice to have

  • machine learning
  • reinforcement learning
  • high performance computing
  • virtualization
  • sandboxed code execution environments
  • Kubernetes
  • open-source contributions
  • published research papers

What the JD emphasized

  • 5+ years of industry-related experience
  • strong software engineering background
  • code quality, testing, and performance

Other signals

  • reinforcement learning
  • large language models
  • reasoning abilities
  • code generation
  • mathematics
  • agentic tasks