Engineering Manager, ML Performance and Scaling

Anthropic Anthropic · AI Frontier · AI Research & Engineering

Engineering Manager for ML Performance and Scaling teams, focusing on optimizing inference and training systems, identifying bottlenecks, and maximizing efficiency. Requires management experience, background in ML/AI, and interest in safe AI development.

What you'd actually do

  1. Provide front-line leadership of engineering efforts to improve model performance and scalr our inference and training systems
  2. Become familiar with the team’s technical stack enough to make targeted contributions as an individual contributor
  3. Manage day-to-day execution of the team's work
  4. Prioritize the team’s work and manage projects in a highly dynamic, fast paced environment
  5. Coach and support your reports in understanding, and pursuing, their professional growth

Skills

Required

  • 1+ years of management experience in a technical environment
  • background in machine learning, AI, or a similar related technical field
  • building strong relationships with stakeholders
  • understanding and contributing to discussions on complex technical topics
  • managing teams through periods of rapid growth and change

Nice to have

  • performance or distributed systems
  • High performance, large-scale ML systems
  • GPU/Accelerator programming
  • ML framework internals
  • OS internals
  • Language modeling with transformers

What the JD emphasized

  • performance and scaling
  • inference and training systems
  • AI safety

Other signals

  • performance and scaling
  • inference and training systems
  • identify and remove bottlenecks
  • maximize efficiency