Research Engineer / Research Scientist, Vision

Anthropic Anthropic · AI Frontier · San Francisco, CA · AI Research & Engineering

Research Engineer/Scientist focused on vision and spatial reasoning for LLMs, working on pretraining, RL, and runtime techniques like agentic harnesses. Involves developing and evaluating multimodal capabilities, creating benchmarks, and partnering with product teams to improve Claude models.

What you'd actually do

  1. Run experiments to evaluate architectural variants, data strategies, and SL and RL techniques to improve Claude’s vision
  2. Develop and test tools, skills, and agentic infrastructure that enable Claude to reason over visual inputs
  3. Create evaluations and benchmarks that measure progress on multimodal capabilities across training and deployment
  4. Work with our product org to find solutions to our most vexing API customer challenges related to vision and spatial reasoning

Skills

Required

  • ML
  • computer vision
  • software engineering
  • large vision language models
  • synthetic and real-world visual training datasets
  • systematic prompting
  • finetuning
  • evaluation

Nice to have

  • large-scale pretraining
  • SL
  • RL on language models
  • Deep learning research on images, video, or other modalities
  • Developing complex agentic systems using LLMs
  • High-performance ML systems (GPUs, TPUs, JAX, PyTorch)
  • Large-scale ETL and data pipeline development

What the JD emphasized

  • 7+ years of ML, computer vision, and software engineering experience
  • strong computer vision background
  • visual and spatial reasoning are core to fully unlocking the capabilities of LLMs

Other signals

  • research
  • vision
  • LLMs
  • evaluation
  • agentic