Staff + Senior Software Engineer, Inference

Anthropic Anthropic · AI Frontier · San Francisco, CA · Software Engineering - Infrastructure

Software Engineer focused on building and maintaining the distributed systems that serve large language models (like Claude) to millions of users. The role involves maximizing compute efficiency, enabling research through high-performance inference infrastructure, and integrating new AI hardware and model architectures.

What you'd actually do

  1. Design, build, and maintain the distributed systems that serve Claude to millions of users worldwide
  2. Develop intelligent request routing, load balancing, and traffic management systems across thousands of accelerators
  3. Maximize compute efficiency across the fleet by autoscaling and orchestrating production, research, and experimental workloads
  4. Build and operate production-grade deployment pipelines for releasing new models to users
  5. Provide high-performance inference infrastructure that enables researchers to develop next-generation models

Skills

Required

  • Significant software engineering experience
  • distributed systems
  • Python
  • Rust

Nice to have

  • high-performance, large-scale distributed systems
  • implementing and deploying machine learning systems at scale
  • load balancing, request routing, or traffic management systems
  • LLM inference optimization, batching, and caching strategies
  • Kubernetes and cloud infrastructure (AWS, GCP, Azure)

What the JD emphasized

  • maximizing compute efficiency
  • enabling breakthrough research
  • high-performance inference infrastructure
  • new AI accelerator platforms
  • new model architectures
  • LLM inference optimization

Other signals

  • Serving Claude to millions of users worldwide
  • Maximizing compute efficiency
  • Enabling breakthrough research by giving scientists high-performance inference infrastructure
  • Integrating new AI accelerator platforms
  • Supporting inference for new model architectures