Senior Engineer, Storage Control Plane

Weights & Biases Weights & Biases · Data AI · Warsaw, Poland · Technology

Senior Storage Engineer, Control Plane to design, build, and operate the control plane for a high-performance AI storage platform. The role involves evolving storage systems by building reliable, scalable, and high-throughput solutions for AI workloads, with close collaboration across infrastructure, compute, and platform teams to ensure storage services scale automatically and seamlessly while maximizing performance and reliability.

What you'd actually do

  1. Design and implement a highly scalable multi-tenant control plane that supports CoreWeave’s growing AI storage and cloud infrastructure needs.
  2. Contribute to the development of exabyte-scale, S3-compatible object storage, distributed file system and integrate dedicated storage clusters into diverse customer environments.
  3. Work with technologies such as RDMA, GPU Direct Storage, RoCE, InfiniBand, SPDK, and distributed filesystems to optimize storage performance and efficiency.
  4. Participate in efforts to improve the reliability, durability, and observability of our storage stack.
  5. Collaborate with operations teams to monitor, analyze, and optimize storage systems using telemetry, metrics, and dashboards to improve performance, latency, and resilience.

Skills

Required

  • storage systems engineering or infrastructure
  • object storage or distributed filesystems in production environments
  • S3, NFS
  • Ceph, DAOS
  • Go, C, or Rust
  • storage observability tools and telemetry pipelines
  • cloud-native infrastructure, Kubernetes, and scalable system architecture
  • debugging and problem-solving skills in distributed, high-performance environments

Nice to have

  • RDMA
  • GPU Direct Storage
  • RoCE
  • InfiniBand
  • SPDK
  • ClickHouse
  • Prometheus
  • Grafana