Capacity and Infrastructure Lead

Baseten · Data AI · San Francisco, CA · G&A

This role focuses on building the analytics foundation for tracking infrastructure usage, capacity, and cloud spend across Baseten's AI inference platform. The lead will create data models to unify cloud billing, usage, capacity, and telemetry data, working with various teams to optimize cost and utilization. Responsibilities include building dashboards, modeling data from multiple providers, defining core metrics, supporting forecasting, developing anomaly alerting, and ensuring data reliability.

What you'd actually do

  1. Build, enhance, and maintain dashboards that track cloud cost, usage, capacity, utilization, and infrastructure efficiency across Baseten’s fleet.
  2. Ingest, clean, and model billing and usage data from multiple cloud and infrastructure providers, including sources such as cost and usage reports, provider APIs, invoices, and internal infrastructure systems.
  3. Create canonical data models for capacity and usage across a variety of dimensions and time grains.
  4. Partner with Infrastructure Engineering and Finance to define core metrics for cloud spend, committed capacity utilization, cost allocation, unit economics, and infrastructure efficiency.
  5. Support forecasting and planning workflows by modeling historical usage, capacity trends, and infrastructure demands.

Skills

Required

  • SQL
  • dbt
  • AWS Cost and Usage Reports
  • Google Cloud Billing Export
  • committed use discounts
  • savings plans
  • reservations
  • usage-based pricing
  • credits
  • cloud cost allocation
  • Python
  • Sigma
  • Hex

Nice to have

  • FOCUS standard or other cloud cost data normalization frameworks

What the JD emphasized

  • 5+ years of experience in analytics, BI, infrastructure analytics, cloud cost management, or a related role.
  • Strong SQL skills, including experience writing complex transformations across disparate datasets.
  • Experience building clean, reusable data models and semantic layers in dbt.
  • Working knowledge of concepts like AWS Cost and Usage Reports, Google Cloud Billing Export, committed use discounts, savings plans, reservations, usage-based pricing, credits, and cloud cost allocation.