Principal Engineer, Storage

Weights & Biases Weights & Biases · Data AI · Bellevue, WA · Technology

Principal Engineer role focused on designing, building, and operating the data plane for a high-performance AI storage platform. The role involves developing scalable, high-throughput storage solutions, optimizing performance and reliability, and collaborating with infrastructure and platform teams. Experience with object storage, distributed file systems, and systems programming languages is required.

What you'd actually do

  1. Design and implement a highly scalable multi-tenant control plane that supports CoreWeave’s growing AI storage and cloud infrastructure needs.
  2. Contribute to the development of exabyte-scale, S3-compatible object storage, distributed file system and integrate dedicated storage clusters into diverse customer environments.
  3. Work with technologies such as RDMA, GPU Direct Storage, RoCE, InfiniBand, SPDK, and distributed filesystems to optimize storage performance and efficiency.
  4. Participate in efforts to improve the reliability, durability, and observability of our storage stack.
  5. Collaborate with operations teams to monitor, analyze, and optimize storage systems using telemetry, metrics, and dashboards to improve performance, latency, and resilience.

Skills

Required

  • storage systems engineering
  • object storage
  • block storage
  • distributed file systems
  • Go
  • C
  • Rust
  • cloud-native infrastructure
  • Kubernetes
  • scalable system architecture
  • debugging
  • problem-solving

Nice to have

  • RDMA
  • GPU Direct Storage
  • RoCE
  • InfiniBand
  • SPDK
  • HDD
  • SSD

What the JD emphasized

  • storage systems engineering
  • object storage, block storage or distributed filesystems
  • systems programming language such as Go, C, or Rust
  • cloud-native infrastructure, Kubernetes, and scalable system architecture
  • debugging and problem-solving skills in distributed, high-performance environments