Staff Product Manager, Compute (sf, Sunnyvale)

Crusoe · Data AI · San Francisco, CA - US · Product and Design

Product Manager responsible for defining and delivering Crusoe Cloud's core compute offerings, including GPU and CPU infrastructure, bare metal, virtualized compute, and orchestration layers that power AI training and inference workloads. This role owns the strategy and execution of compute capabilities that form the foundation of Crusoe’s IaaS and Managed AI services.

What you'd actually do

  1. Define and execute the product strategy for Crusoe Cloud compute services, including GPU and CPU infrastructure, bare metal offerings, and virtualized compute environments supporting AI training and inference workloads
  2. Translate emerging AI workload requirements into product capabilities across performance, scheduling, isolation, utilization, and reliability
  3. Drive roadmap decisions across hardware platforms, including new GPU generations, server architectures, and cluster topologies, ensuring alignment between infrastructure investments and customer demand
  4. Partner with engineering and infrastructure teams to deliver fleet-level capabilities such as provisioning, lifecycle management, observability, and performance optimization across large-scale compute environments
  5. Collaborate with networking and storage product teams to ensure integrated infrastructure performance for distributed AI workloads

Skills

Required

  • Strong product management experience delivering infrastructure or platform products, ideally within cloud infrastructure, HPC, or AI/ML environments
  • Deep understanding of compute infrastructure concepts including virtualization, containerization, distributed systems, and large-scale cluster operations
  • Familiarity with GPU-based workloads and AI infrastructure, including training and inference characteristics, scheduling challenges, and performance considerations
  • Experience working across hardware and software boundaries, translating infrastructure capabilities into customer-facing products
  • Ability to balance technical tradeoffs, customer requirements, and infrastructure economics when making product decisions
  • Strong analytical skills with the ability to interpret utilization, performance, and financial metrics to guide roadmap prioritization
  • Experience collaborating across engineering, operations, finance, and GTM teams to deliver complex infrastructure products
  • Strong communication skills with the ability to articulate technical concepts and connect them to customer and business outcomes

Nice to have

  • Experience working with large-scale GPU fleets or AI infrastructure platforms
  • Familiarity with Kubernetes, Slurm, or other workload orchestration systems used for AI and HPC workloads
  • Experience building products for AI-native customers or operating AI workloads directly
  • Background in infrastructure economics, capacity planning, or capital-intensive product environments

What the JD emphasized

  • AI training and inference workloads
  • GPU-based workloads and AI infrastructure
  • large-scale GPU fleets or AI infrastructure platforms