Manager, Infrastructure Strategy & Operations

Together AI Together AI · Data AI · San Francisco, CA · Business Operations

This role focuses on the strategy, operations, and analytical backbone for scaling compute infrastructure at an AI-native cloud company. It involves research, benchmarking, and decision frameworks for sourcing, evaluating, and deploying compute, with a focus on market intelligence, site comparisons, and operational analysis. Responsibilities include building dashboards for visibility into costs and utilization, developing comparison frameworks for sourcing decisions, and evaluating data center sites and energy options. The role requires strong quantitative skills, experience in high-growth startups or AI companies, and familiarity with AI productivity tools.

What you'd actually do

  1. Conduct strategic analysis on how to scale and deploy Together's compute infrastructure, translating complex operational data into clear, actionable recommendations for leadership.
  2. Build dashboards and reporting infrastructure that give the team real-time visibility into compute utilization, infrastructure costs, deployment status, and vendor pipelines.
  3. Identify opportunities to optimize how infrastructure is allocated and operated across workloads through compute utilization analysis.
  4. Develop and maintain comparison frameworks for infrastructure sourcing decisions (own vs. lease, location strategy, vendor selection, site evaluation) and synthesize vendor proposals and market data into decision-ready recommendations.
  5. Research and evaluate data center sites and energy sourcing options, comparing power availability, connectivity, permitting timelines, deployment readiness, and reliability.

Skills

Required

  • 5+ years of experience in management consulting, business operations, strategy, capacity planning or a similar analytically rigorous role.
  • Strong quantitative skills with a data-driven approach to decision-making and a track record of structuring ambiguous problems and building analyses from scratch.
  • Prior experience at a high-growth startup or AI company navigating rapid scaling.
  • Ability to learn new domains quickly and operate effectively in unfamiliar territory.
  • Comfort operating with ambiguity and speed; strong judgment on having directional answers.
  • Excellent communication skills with the ability to distill complex comparisons into clear, concise recommendations for technical and non-technical audiences.
  • Proficiency in Excel/Google Sheets
  • familiarity with SQL or data visualization tools
  • fluency with AI productivity tools (e.g., Claude Code, Codex)

Nice to have

  • Experience in cloud infrastructure, data center strategy , or GPU/compute procurement.
  • Familiarity with power markets, energy procurement, or renewable energy economics.
  • Background in infrastructure, energy, or hardware industries.

What the JD emphasized

  • scaling compute infrastructure
  • GPU/compute procurement
  • AI-native cloud company