What you'd actually do

Design and implement a robust framework of Key Performance Indicators (KPIs) from scratch. You will define and track metrics for uptime, MTTR (Mean Time to Repair), deployment velocity, and power utilization, providing data-driven updates to executive leadership.

Act as the technical lead for the "white space" while maintaining a deep understanding of the specialized electrical and mechanical systems (UPS, PDUs, specialized cooling) that support our unique Sparks deployments.

Lead the operational rollout for Crusoe Sparks (NV) and the San Jose Lab (CA). Develop the roadmap for scaling operations as the West Coast Region expands.

Oversee the day-to-day maintenance of AI-optimized hardware. Drive rapid diagnostics, component replacement (GPU trays, DIMMs, etc.), and streamlined RMA processes across the region.

Bridge the gap between the San Jose Lab and our Crusoe Cloujd production sites. Document deployment standards that allow seamless hardware transitions from experimental lab phases to large-scale production.

Skills

Required

8+ years in data center operations, managing distributed white space or lab environments across multiple locations.
strong technical understanding of data center electrical and mechanical systems.
demonstrated experience defining and building operational metrics.
hands-on experience with enterprise-grade server architecture
experience operating in colocation or leased-space environments.
willingness to travel between Crusoe Cloud data center locations as needed

Nice to have

specific experience with GPU-heavy clusters (NVIDIA/AMD) is highly preferred.

Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens — to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster.

We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that — with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI.

We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved — people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services.

If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe.

About the Role

As the Senior Manager of Data Center Operations, you will be the operational anchor for our initial deployment in Sparks, Nevada, with immediate responsibility for all white-space hardware within the current and future Crusoe Sparks sites.

This is a role for a technical "builder." You will oversee the evolution of our West Coast Regional operations, including our high-density AI Lab in San Jose, California. You must possess a deep understanding of the specialized electrical and mechanical infrastructure that makes Crusoe Sparks unique. Beyond managing hardware, you will be responsible for defining, building, and reporting on the KPIs that will govern regional performance and operational transparency for senior leadership.

What You’ll Be Working On

KPI Architecture & Leadership Reporting: Design and implement a robust framework of Key Performance Indicators (KPIs) from scratch. You will define and track metrics for uptime, MTTR (Mean Time to Repair), deployment velocity, and power utilization, providing data-driven updates to executive leadership.
Infrastructure Oversight: Act as the technical lead for the "white space" while maintaining a deep understanding of the specialized electrical and mechanical systems (UPS, PDUs, specialized cooling) that support our unique Sparks deployments.
Regional Scale & Strategy: Lead the operational rollout for Crusoe Sparks (NV) and the San Jose Lab (CA). Develop the roadmap for scaling operations as the West Coast Region expands.
Hardware Lifecycle & Break-Fix: Oversee the day-to-day maintenance of AI-optimized hardware. Drive rapid diagnostics, component replacement (GPU trays, DIMMs, etc.), and streamlined RMA processes across the region.
Lab-to-Production Pipeline: Bridge the gap between the San Jose Lab and our Crusoe Cloujd production sites. Document deployment standards that allow seamless hardware transitions from experimental lab phases to large-scale production.
Vendor & Landlord Relations: Act as the primary liaison for colocation landlords and utility partners. Hold them accountable to SLAs, ensuring the facility infrastructure meets the demanding requirements of our high-density AI clusters.
Team Leadership: Build, mentor, and scale a high-performing regional team of technicians. Foster a culture of technical precision, safety, and operational discipline.

What You’ll Bring to the Team

Proven Leadership: 8+ years in data center operations, managing distributed white space or lab environments across multiple locations.
Infrastructure Fluency: A strong technical understanding of data center electrical and mechanical systems. You can speak the language of facilities engineers and understand the unique constraints of high-density AI power and cooling.
Analytical Rigor: Demonstrated experience defining and building operational metrics. You have a track record of using data to tell a story and drive process improvements.
Deep Hardware Expertise: Hands-on experience with enterprise-grade server architecture; specific experience with GPU-heavy clusters (NVIDIA/AMD) is highly preferred.
The Multi-Site Mindset: Experience operating in colocation or leased-space environments. You know how to manage diverse landlord relationships to protect Crusoe’s operational interests.
Tactical Versatility: You are equally comfortable presenting high-level KPI dashboards to the VP of Operations as you are on the floor with a crash cart and a multimeter.
Mobility & Reliability: Willingness to travel between Crusoe Cloud data center locations as needed, and the flexibility to support critical hardware failures or deployment pushes.

Benefits:

Competitive compensation and equity packages
Restricted Stock Units
Paid time off, paid holidays & leave of absence programs
Comprehensive health, dental & vision insurance
Employer contributions to HSA account
Paid parental leave
Paid life insurance, short-term and long-term disability
Professional development & tuition reimbursement
Mental health & wellness support
Commuter benefits (parking & transit)
Cell phone stipend
401(k) Retirement plan with company match up to 4% of salary
Volunteer time off
Global travel insurance & emergency assistance
Daily meals allowance
Additional perks & programs specific to location

Compensation Range

Compensation will be paid in the range of up to $179,000 -$218,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

About the Role

What You’ll Be Working On

KPI Architecture & Leadership Reporting: Design and implement a robust framework of Key Performance Indicators (KPIs) from scratch. You will define and track metrics for uptime, MTTR (Mean Time to Repair), deployment velocity, and power utilization, providing data-driven updates to executive leadership.
Infrastructure Oversight: Act as the technical lead for the "white space" while maintaining a deep understanding of the specialized electrical and mechanical systems (UPS, PDUs, specialized cooling) that support our unique Sparks deployments.
Regional Scale & Strategy: Lead the operational rollout for Crusoe Sparks (NV) and the San Jose Lab (CA). Develop the roadmap for scaling operations as the West Coast Region expands.
Hardware Lifecycle & Break-Fix: Oversee the day-to-day maintenance of AI-optimized hardware. Drive rapid diagnostics, component replacement (GPU trays, DIMMs, etc.), and streamlined RMA processes across the region.
Lab-to-Production Pipeline: Bridge the gap between the San Jose Lab and our Crusoe Cloujd production sites. Document deployment standards that allow seamless hardware transitions from experimental lab phases to large-scale production.
Vendor & Landlord Relations: Act as the primary liaison for colocation landlords and utility partners. Hold them accountable to SLAs, ensuring the facility infrastructure meets the demanding requirements of our high-density AI clusters.
Team Leadership: Build, mentor, and scale a high-performing regional team of technicians. Foster a culture of technical precision, safety, and operational discipline.

What You’ll Bring to the Team

Proven Leadership: 8+ years in data center operations, managing distributed white space or lab environments across multiple locations.
Infrastructure Fluency: A strong technical understanding of data center electrical and mechanical systems. You can speak the language of facilities engineers and understand the unique constraints of high-density AI power and cooling.
Analytical Rigor: Demonstrated experience defining and building operational metrics. You have a track record of using data to tell a story and drive process improvements.
Deep Hardware Expertise: Hands-on experience with enterprise-grade server architecture; specific experience with GPU-heavy clusters (NVIDIA/AMD) is highly preferred.
The Multi-Site Mindset: Experience operating in colocation or leased-space environments. You know how to manage diverse landlord relationships to protect Crusoe’s operational interests.
Tactical Versatility: You are equally comfortable presenting high-level KPI dashboards to the VP of Operations as you are on the floor with a crash cart and a multimeter.
Mobility & Reliability: Willingness to travel between Crusoe Cloud data center locations as needed, and the flexibility to support critical hardware failures or deployment pushes.

Benefits:

Competitive compensation and equity packages
Restricted Stock Units
Paid time off, paid holidays & leave of absence programs
Comprehensive health, dental & vision insurance
Employer contributions to HSA account
Paid parental leave
Paid life insurance, short-term and long-term disability
Professional development & tuition reimbursement
Mental health & wellness support
Commuter benefits (parking & transit)
Cell phone stipend
401(k) Retirement plan with company match up to 4% of salary
Volunteer time off
Global travel insurance & emergency assistance
Daily meals allowance
Additional perks & programs specific to location

Compensation Range

Senior Manager, Data Center Operations

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

About the Role

What You’ll Be Working On

What You’ll Bring to the Team

About the Role

What You’ll Be Working On

What You’ll Bring to the Team