Software Engineer, Productivity - Model Performance

OpenAI OpenAI · AI Frontier · San Francisco, CA · Scaling

Software Engineer focused on improving developer productivity for AI model performance teams, working on systems, tooling, and infrastructure for training and inference workloads. Contributes to CI/CD, testing pipelines, and developer experience.

What you'd actually do

  1. Improve development workflows for engineers working on model performance infrastructure
  2. Design and improve CI/CD, release, validation, and testing pipelines
  3. Build and maintain tools that improve reliability, iteration speed, and engineering confidence
  4. Partner closely with engineers to identify friction in testing, debugging, deployment, and development workflows
  5. Contribute to infrastructure efforts that support performance-critical training and inference systems

Skills

Required

  • Python
  • CI/CD
  • developer infrastructure
  • testing systems
  • tooling
  • build/release workflows

Nice to have

  • PyTorch ecosystem
  • C++
  • Rust

What the JD emphasized

  • performance-critical engineering work
  • performance-critical training and inference systems
  • repeated friction — slow tests, flaky CI, brittle release processes, painful debugging, unclear validation — your instinct is to fix the underlying system