Senior Storage Engineer

Crusoe · Data AI · San Francisco, CA - US · Cloud Engineering

This role is for a Senior Storage Engineer responsible for the administration, scaling, and operational excellence of high-performance all-flash storage ecosystems (VAST Data or Pure Storage) to support AI training and HPC workloads. The engineer will ensure sub-millisecond latency and high-throughput data backbone for GPU clusters, manage data protection, and automate provisioning.

What you'd actually do

  1. Own the end-to-end management of VAST Data (Universal Storage) and Pure Storage (FlashBlade/FlashArray) environments, including initial setup, volume provisioning, and export management.
  2. Proactively monitor VAST and Pure clusters for IOPS, throughput, and latency bottlenecks, ensuring storage performance stays ahead of GPU demand.
  3. Execute software upgrades (Purity//FB, VAST OS), expansion of D-Nodes/C-Nodes, and hardware refreshes with zero downtime for our AI customers.
  4. Manage snapshots, replication policies, and data reduction (deduplication/compression) strategies to optimize TCO while ensuring 100% data durability.
  5. Act as the lead technical point of contact for storage incidents, working directly with VAST and Pure support engineering to resolve complex fabric or metadata issues.

Skills

Required

  • 5–8+ years of experience in Storage Administration
  • 3+ years of hands-on experience managing VAST Data or Pure Storage
  • Deep understanding of NFS over RDMA, SMB, and NVMe-o-F
  • Strong command of the Linux CLI
  • Understanding of how storage interacts with InfiniBand and RoCE fabrics
  • Proficiency in Python, Bash, or similar for automating volume creation, quota management, and reporting via storage APIs.
  • Meticulous approach to capacity planning and documentation

Nice to have

  • Experience with Pure1 or VAST VMS/Insight
  • Familiarity with Slurm or Kubernetes (CSI) integration
  • Prior experience in a "Large Scale" environment (multi-petabyte footprints)

What the JD emphasized

  • 3+ years of hands-on experience managing VAST Data or Pure Storage