Senior Software Engineer, Storage

Crusoe · Data AI · San Francisco, CA - US · Cloud Engineering

Senior Software Engineer on the storage team responsible for designing, building, and optimizing next-generation cloud storage products. Focus on high-performance, reliable, and scalable distributed storage systems for AI/HPC workloads.

What you'd actually do

  1. Building core components of our foundational storage products, purpose built for high performance AI and ML workloads
  2. Contributing to distributed file, block and object storage products, with a focus on filesystem based solutions
  3. Design and implement high-performance, scalable, and resilient storage architectures that are highly extensible
  4. Proposing and prototyping novel strategies to scale performance and system throughput for our most demanding customer workloads
  5. Building observability, metrics and tooling for our services and fleet

Skills

Required

  • modern software development best practices
  • Go, Java, C/C++, or Rust
  • multi-tenant, cloud scale distributed storage infrastructure software and systems
  • File (e.g., NFS, SMB, Lustre), Object, or Block Storage (e.g., NVMe, iSCSI)
  • high performance filesystem based products
  • VFS and linux filesystems (e.g., ext4, XFS, ZFS)
  • Linux and its storage subsystems
  • monitoring tools (Prometheus, Grafana)
  • log analysis
  • distributed tracing
  • debugging

Nice to have

  • AI/HPC storage solutions, such as Parallel Filesystems or petabyte+ scale Object Storage
  • networking technologies like RDMA and Infiniband
  • modern storage technologies (e.g GPU Direct Storage, F2FS, SPDK etc)
  • Nvidia SuperNIC DPUs for storage optimization
  • Storage Virtualization & Orchestration
  • volume placement strategies
  • distributed metadata management
  • Research publications or open-source contributions to storage-related projects

What the JD emphasized

  • high performance AI and ML workloads
  • high performance filesystem based products
  • storage systems