Workload Porting & Performance Engineer

OpenAI OpenAI · AI Frontier · San Francisco, CA · Scaling

This role focuses on evaluating new hardware platforms by porting and analyzing performance of benchmarks and real-world workloads, identifying system bottlenecks, and adapting workloads to utilize hardware capabilities. It involves debugging across hardware and software boundaries, with a preference for experience in AI/ML workloads.

What you'd actually do

  1. Port and enable benchmarks and real-world workloads on new hardware platforms.
  2. Evaluate system performance across compute, memory, storage, and networking subsystems.
  3. Identify and analyze performance bottlenecks and inefficiencies.
  4. Adapt and optimize workloads to better utilize hardware capabilities.
  5. Develop and run performance experiments and profiling workflows.

Skills

Required

  • performance analysis
  • benchmarking
  • workload optimization
  • system architecture
  • CPU/GPU
  • memory
  • I/O subsystems
  • porting workloads
  • profiling tools
  • performance debugging techniques
  • root cause analysis
  • large-scale or distributed system environments

Nice to have

  • AI/ML workloads
  • training systems
  • inference systems
  • GPU or accelerator-based systems
  • low-level performance tools
  • compilers
  • runtime optimization
  • collaborating with hardware and architecture teams

What the JD emphasized

  • performance analysis
  • workload optimization
  • system-level debugging
  • hardware/software boundaries
  • AI/ML workloads