Software Engineer, Compute - Storage

OpenAI OpenAI · AI Frontier · San Francisco, CA · Scaling

Software Engineer for Storage Infrastructure team at OpenAI, responsible for building and operating storage systems that support both research and production workloads. This role involves working with object storage, cross-region data movement, lifecycle management, and a federation layer, primarily using Rust and Kubernetes.

What you'd actually do

  1. Build and operate storage services that underpin OpenAI’s research infrastructure
  2. Develop object storage systems across cloud and in-house environments
  3. Build systems for cross-region data movement, replication, and recovery
  4. Design lifecycle management capabilities that keep data durable, available, and cost-effective
  5. Evolve the federation layer that unifies multiple backend systems behind a simple interface

Skills

Required

  • experience building or operating distributed systems in production
  • worked on storage infrastructure, object stores, distributed filesystems, or other data-intensive backend systems
  • owning infrastructure end to end, including debugging and long-term reliability improvements
  • strong production code, ideally in Rust or another systems-oriented language
  • comfortable working with Kubernetes-based systems

Nice to have

  • experience with tools such as Terraform, Grafana, or similar infrastructure and observability tooling