Manager, Field Operations Coordination - Spark

Crusoe · Data AI · Denver, CO - US · Strategic Initiatives

This role is for a Manager, Field Operations Coordination at Crusoe, an AI infrastructure company. The primary focus is on ensuring the operational reliability and uptime of their fleet of modular data centers (Spark). Responsibilities include coordinating field operations, ensuring operational readiness through playbooks, managing logistics and safety protocols, and driving alignment across internal teams and cross-functional partners. The role requires experience in data center operations, industrial equipment, cross-functional leadership, logistics, and safety knowledge, particularly in mission-critical environments. While the company is in the AI infrastructure space, this specific role is focused on the operational management of the physical infrastructure rather than direct AI/ML development.

What you'd actually do

  1. Fleet Reliability & Uptime You will be the Spark point-person for uptime and reliability across the Spark fleet. You'll serve as the primary field operations interface with the broader business, driving alignment on best practices and working to standardize and simplify protocols across Spark deployments.
  2. Operational Readiness You will coordinate across teams to ensure operational playbooks are in place and kept current for both air-cooled and liquid-cooled Spark units (CDUs, cooling loops, leak detection). You'll work closely with field and training teams to ensure technicians are prepared and certified to support next-generation infrastructure, and ensure SLA commitments to managed services customers are met.
  3. Logistics & Safety You will partner with logistics and operations teams to ensure critical sparing strategies and staffing models are defined, communicated, and built to support 4-hour response times at remote sites. You will champion a zero-incident safety culture by ensuring OSHA and LOTO protocols are current, well-understood, and consistently followed across all field work.

Skills

Required

  • 7+ years of progressive career experience
  • 5+ years in data center operations or field services
  • managing reliability at scale
  • demonstrated cross-functional coordination
  • Demonstrated understanding of industrial equipment (UPS, transformers)
  • working knowledge of liquid-cooled (DLC) systems
  • Proven ability to drive alignment across teams
  • Experience building or coordinating sparing and staffing strategies for remote, time-sensitive operations
  • Understanding of OSHA standards and LOTO protocols for high-voltage and mission-critical environments
  • Comfortable operating in a fast-moving, high-growth organization

What the JD emphasized

  • own the operational reliability
  • ensure that the people, processes, and playbooks needed to keep our fleet running are in place, aligned, and understood across the organization
  • operational reliability
  • standardize and simplify protocols
  • ensure operational playbooks are in place
  • ensure SLA commitments to managed services customers are met
  • ensure critical sparing strategies and staffing models are defined, communicated, and built to support 4-hour response times at remote sites
  • champion a zero-incident safety culture
  • consistently followed across all field work
  • 7+ years of progressive career experience, and 5+ years in data center operations or field services, with a track record of managing reliability at scale and demonstrated cross-functional coordination
  • Demonstrated understanding of industrial equipment
  • working knowledge of liquid-cooled (DLC) systems
  • Proven ability to drive alignment across teams
  • Experience building or coordinating sparing and staffing strategies for remote, time-sensitive operations
  • Understanding of OSHA standards and LOTO protocols for high-voltage and mission-critical environments
  • Comfortable operating in a fast-moving, high-growth organization where priorities shift and roles evolve