Capacity Engineer, Compute

Anthropic Anthropic · AI Frontier · Compute

This role focuses on capacity engineering and cloud infrastructure strategy for an AI company, advising on budget, planning, and performance. It involves developing tools for engineers to understand costs and efficiency, designing automation for capacity planning, and building cost-to-serve analytics. The role also involves managing vendor relationships and identifying infrastructure inefficiencies.

What you'd actually do

  1. Develop self-service tools and processes to enable anthropic engineers to understand their capacity, efficiency, and costs
  2. Design, develop, and lead necessary automation to help capacity plan for both near and long term outcomes
  3. Institute and design governance workflows to help manage additional capacity request approvals
  4. Investigate new capacity requests to ensure the best use of resources and that instances are sized appropriately
  5. Build and drive cost to serve analytics programs to guide engineering, finance, and leadership on the total cost (TCO) and infrastructure impact of our scaling factors. Inform pricing conversations through customer profile sensitive gross margin analysis.

Skills

Required

  • capacity engineering
  • technical role experience
  • public cloud providers
  • data modeling for public cloud
  • budgeting
  • capacity planning
  • cloud efficiency optimization
  • scripting
  • automation tools
  • cloud compute, storage, network, and services

Nice to have

  • Attention to detail
  • passion for correctness

What the JD emphasized

  • 5+ years experience in capacity engineering
  • 5+ years experience in a technical role