Software Engineer, Hardware

OpenAI OpenAI · AI Frontier · San Francisco, CA · Scaling

Software Engineer role focused on building and optimizing the low-level stack for AI supercomputing clusters, including runtimes, kernels, and compiler infrastructure, to orchestrate computation and data movement for distributed training workloads.

What you'd actually do

  1. Design and build APIs and runtime components to orchestrate computation and data movement across heterogeneous ML workloads.
  2. Contribute to compiler infrastructure, including the development of optimizations and compiler passes to support evolving hardware.
  3. Engineer and optimize compute and data kernels, ensuring correctness, high performance, and portability across simulation and production environments.
  4. Profile and optimize system bottlenecks, especially around I/O, memory hierarchy, and interconnects, at both local and distributed scales.
  5. Develop simulation infrastructure to validate runtime behaviors, test training stack changes, and support early-stage hardware and system development.

Skills

Required

  • Rust
  • Python
  • systems programming
  • compiler development
  • kernel authoring
  • accelerator programming
  • runtime systems
  • distributed systems
  • high-performance simulation

Nice to have

  • C++

What the JD emphasized

  • low-level stack
  • orchestrates computation and data movement
  • distributed training workloads
  • systems programming
  • ML infrastructure
  • high-performance computing
  • ergonomic developer APIs
  • highly efficient runtime systems
  • compiler development
  • kernel authoring
  • runtime systems
  • distributed systems
  • high-performance simulation

Other signals

  • AI workloads
  • AI-native silicon
  • AI models
  • ML infrastructure
  • distributed training workloads
  • ML workloads
  • evolving hardware
  • ML system demands