Senior Engineer, Storage Control Plane

Weights & Biases Weights & Biases · Data AI · Bellevue, WA +1 · Technology

Senior Engineer, Storage Control Plane at CoreWeave, focusing on designing, building, and operating a high-performance AI storage platform. The role involves developing scalable, multi-tenant control planes and optimizing storage systems for AI workloads, collaborating with infrastructure and platform teams.

What you'd actually do

  1. Design and implement a highly scalable multi-tenant control plane that supports CoreWeave’s growing AI storage and cloud infrastructure needs.
  2. Contribute to the development of exabyte-scale, S3-compatible object storage, distributed file system and integrate dedicated storage clusters into diverse customer environments.
  3. Work with technologies such as RDMA, GPU Direct Storage, RoCE, InfiniBand, SPDK, and distributed filesystems to optimize storage performance and efficiency.
  4. Participate in efforts to improve the reliability, durability, and observability of our storage stack.
  5. Collaborate with operations teams to monitor, analyze, and optimize storage systems using telemetry, metrics, and dashboards to improve performance, latency, and resilience.

Skills

Required

  • storage systems engineering or infrastructure
  • object storage or distributed filesystems in production environments
  • S3, NFS
  • Ceph, DAOS
  • Go, C, or Rust
  • cloud-native infrastructure
  • Kubernetes
  • scalable system architecture
  • debugging and problem-solving skills in distributed, high-performance environments
  • Clear communicator

Nice to have

  • RDMA
  • GPU Direct Storage
  • RoCE
  • InfiniBand
  • SPDK
  • ClickHouse
  • Prometheus
  • Grafana

What the JD emphasized

  • high-performance AI storage platform
  • exabyte-scale
  • high-throughput solutions
  • scale automatically and seamlessly
  • optimize storage performance and efficiency
  • reliability, durability, and observability
  • improve performance, latency, and resilience