Performance Modeling Engineer

OpenAI OpenAI · AI Frontier · San Francisco, CA · Scaling

Develop and apply performance modeling tools to evaluate AI system performance and inform architectural decisions for AI infrastructure. This role involves analyzing system behavior, running simulations, and quantifying tradeoffs across compute, memory, networking, and storage.

What you'd actually do

  1. Develop and maintain performance modeling tools and frameworks.
  2. Build models to evaluate system behavior across: - compute, memory, and interconnect subsystems - distributed system scaling and bottlenecks.
  3. Run simulations and analytical models to support architectural tradeoff analysis.
  4. Collaborate with performance modeling lead and system architects to answer forward-looking design questions.
  5. Analyze and interpret modeling outputs, translating results into actionable insights.

Skills

Required

  • Strong software engineering or modeling background
  • Familiarity with system architecture fundamentals
  • Experience with programming and building technical tools or frameworks
  • Ability to reason about performance bottlenecks and scaling behavior
  • Strong analytical skills and comfort working with quantitative models
  • Ability to collaborate across teams and learn new system domains quickly

Nice to have

  • Exposure to AI/ML workloads or distributed systems
  • Experience with simulation tools, performance modeling, or systems analysis
  • Familiarity with data center infrastructure or large-scale systems
  • Experience working with performance data, benchmarking, or profiling tools
  • Interest in system architecture and hardware/software co-design