Senior Network Engineer, Deployment

Crusoe · Data AI · San Francisco, CA - US · Cloud Engineering

Crusoe is an AI infrastructure company that owns and operates its own hardware, from electrons to tokens. They are looking for a Senior Network Engineer to lead the physical and logical implementation of their global network, bringing new data centers and edge sites online for their high-performance compute and GPU-based AI infrastructure. This role involves executing global build-outs, bridging design and reality, validating and commissioning network infrastructure, optimizing deployment automation, managing on-site partners, and handling inventory and capacity management.

What you'd actually do

  1. Execute Global Build-outs: Lead the end-to-end deployment of network infrastructure in new and existing data centers, from initial rack/stack oversight to final hand-off.
  2. Bridge Design and Reality: Take high-level designs from the Network Development team and translate them into site-specific implementation plans, cable maps, and configuration templates.
  3. Validate and Commission: Perform rigorous "Burn-in" testing and site acceptance testing (SAT) for new network clusters, ensuring zero-defect handovers to the Operations team.
  4. Optimize Deployment Automation: Use Python, Ansible, and ZTP (Zero Touch Provisioning) to automate the staging and configuration of hundreds of network devices simultaneously.
  5. Manage On-site Partners: Coordinate with remote hands, structured cabling vendors, and data center providers to ensure physical layer standards (fiber paths, power requirements, and cooling) meet Crusoe’s stringent HPC requirements.

Skills

Required

  • 8+ years of experience in network engineering
  • large-scale data center deployments and infrastructure projects
  • Mastery of Physical Layer Standards: Expert knowledge of structured cabling (SMF/MMF, MPO/MTP), optical transceivers (400G/800G), and data center power/cooling requirements.
  • Strong Routing and Switching Knowledge: Hands-on experience configuring Arista (EOS), Juniper (Junos), and NVIDIA/Mellanox platforms in a leaf-spine architecture.
  • Protocol Proficiency: Solid understanding of BGP, EVPN-VXLAN, and LLDP as they relate to large-scale fabric provisioning.
  • Automation-First Mindset: Proficiency in Python and Ansible for automating repetitive deployment tasks and validating configuration state.
  • Logistical Excellence: Proven ability to manage multiple complex projects simultaneously across different time zones and physical locations.
  • Troubleshooting Expertise: Ability to diagnose complex physical layer and link-layer issues using OTDRs, light meters, and packet captures.
  • Bachelor’s degree in a technical field or equivalent practical experience in hyperscale or ISP environments.

What the JD emphasized

  • high-performance compute (HPC)
  • GPU-based AI infrastructure
  • network infrastructure
  • data centers
  • edge sites
  • physical and logical implementation
  • deployment standards
  • network devices
  • physical layer standards
  • HPC requirements
  • backbone capacity
  • edge interconnects
  • large-scale data center deployments
  • infrastructure projects
  • structured cabling
  • optical transceivers
  • data center power/cooling requirements
  • Routing and Switching Knowledge
  • leaf-spine architecture
  • fabric provisioning
  • deployment tasks
  • complex projects