AI Frontier · AI lab
Currently tracking 199 active AI roles, down 29% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $230k–$555k (avg $372k).
OpenAI currently has 235 active AI-related job listings. The majority of these roles are in the application stage, accounting for 32% of the total, followed closely by the agents stage at 29%. The dominant function for hiring is Engineering, with 168 positions. Frequent tech tags include model_serving, evals, and agent_orchestration, suggesting a focus on deploying and managing AI models. In the last 30 days, OpenAI posted 50 new AI roles, representing a 14% increase compared to the previous 30-day period.
OpenAI currently has 254 active AI-related roles in our index. The most common open titles are: AI Deployment Engineer (4), Partner AI Deployment Engineer - AWS (4), AI Deployment Engineer - Startups (3), AI Deployment Engineer- Codex (3), AI Deployment Engineer, Startups (2). Most positions are in Engineering and Research.
OpenAI's active AI hiring is concentrated in: application (33%), agents (28%), serving infrastructure (10%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
OpenAI is hiring AI talent in: United States (203 roles), United Kingdom (14 roles), Japan (6 roles), Germany (5 roles).
Job postings at OpenAI most frequently reference: model serving, agent orchestration, evals, llm observability, inference infra.
In the past 30 days, OpenAI has posted 56 new AI-related roles.
| Title | Stage | AI score |
|---|---|---|
| Technical Lead Manager - Training Runtime, Data(set) Movement Technical Lead Manager for Training Runtime, focusing on the Data Movement area. This role owns the infrastructure for supplying training jobs with data and managing model state during large-scale model training runs. It involves designing and building a unified dataset read platform, defining APIs, storage contracts, and ensuring reliability and reproducibility. | Data | 9 |
| RE/RS, Data Understanding - Foundations This role focuses on research and development of high-quality datasets for large model training at OpenAI. Responsibilities include synthesizing data, building VQ representations, and processing/filtering data. The role involves treating data quality as a research problem, developing new methods for data selection and transformation, and designing experiments to understand data's impact on model learning. The goal is to translate research into scalable data processing pipelines. |
| 9 |
| RE/RS, Data Understanding (MM) This role focuses on preparing, curating, synthesizing, and understanding multimodal data (images, audio, video) at scale for large model training. It involves research and production problems related to data pipelines, quality filters, and using models for data preparation, with an emphasis on measuring dataset impact on model performance. | Data | 9 |
| Research Infrastructure Engineer, Training Systems This role is for a Research Infrastructure Engineer focused on ML training systems at OpenAI. The engineer will build and maintain the infrastructure that enables novel research ideas for large-scale model training, improving reliability, debuggability, and performance. The work involves debugging across various systems (Python, PyTorch, distributed systems, GPUs, networking, storage) and designing APIs for complex training workflows. | Data | 9 |
| Machine Learning Engineer, Distributed Data Systems - Robotics Machine Learning Engineer focused on designing and scaling distributed data infrastructure for large-scale multimodal training and evaluation in robotics, ensuring reliability and efficiency for rapid iteration cycles. | DataEval Gate | 9 |
| Software Engineer, Platform Systems Software Engineer on the Platform Systems team at OpenAI, responsible for designing and building distributed systems for failure detection, tracing, and observability in large-scale AI training jobs. This role focuses on operating and improving the reliability and performance of OpenAI's training infrastructure. | DataServe | 9 |
| Software Engineer, Distributed Data Systems - Robotics Software Engineer to design and scale infrastructure for large-scale multimodal training and evaluation in robotics at OpenAI. Focus on distributed data pipelines, ML infrastructure, and ensuring scalability and reliability. | DataEval Gate | 9 |
| Training: ML Framework Engineer This role focuses on improving the core distributed machine-learning training runtime for OpenAI, aiming to accelerate researchers and enable frontier-scale model runs. The engineer will work on high-performance data movement, fault-tolerant training frameworks, and distributed process management to increase both training throughput and researcher throughput. | DataPretrain | 9 |
| Training Performance Engineer The Training Performance Engineer will drive efficiency improvements across OpenAI's distributed training stack, focusing on optimizing GPU utilization, throughput, and uptime for large-scale distributed model training. This role involves profiling, analyzing performance bottlenecks in compute, communication, and storage, and collaborating with engineers to improve kernel efficiency, scheduling, and collective communication performance, particularly for pre-training. | Data | 9 |
| RE / RS - Foundations, Search Research role focused on embedding retrieval and agentic search, developing foundational technology for future frontier models. Involves designing new embedding training objectives, scalable vector store architectures, and dynamic indexing methods, with potential for publication and integration into OpenAI products. | DataAgent | 9 |
| Software Engineer, Collective Communication Software Engineer focused on the collective communication stack for large-scale AI model training, using C++ and CUDA to optimize network performance on custom supercomputers. This role directly supports the training of OpenAI's flagship models and collaborates with ML researchers. | Data | 9 |
| Software Engineer, ML Systems & Training Architecture Software Engineer focused on ML Systems & Training Infrastructure for the OpenAI Robotics team, responsible for maintaining and improving the training framework, debugging ML systems, and unblocking researchers and engineers. | Data | 8 |
| Training, Process Management Engineer OpenAI's Training Runtime team is seeking a Process Management Engineer to build and maintain the distributed OS responsible for launching, coordinating, and supervising large-scale machine learning training workloads. This role involves working with Rust and Python to create high-performance, reliable, and observable systems that enable researchers to scale ideas from experiments to production training runs on massive clusters. | Data | 8 |
| Software Engineer, Platform Systems Software Engineer on the Platform Systems team at OpenAI, responsible for designing and building distributed systems for failure detection, tracing, and observability to support large-scale AI model training. The role focuses on improving the reliability, performance, and visibility of OpenAI's training infrastructure. | Data | 8 |
| Software Engineer, Research - Human Data Software Engineer, Research - Human Data role at OpenAI. Focuses on building and maintaining full-stack systems for feedback collection, data labeling, and evaluation pipelines to train and improve AI models. Partners with researchers to translate alignment research into scalable production infrastructure, including inference and model training stacks. Designs user-facing tools and backend services for high-quality data workflows, driving infrastructure improvements for OpenAI's frontier models. | DataPost-train | 8 |
| Technical Lead Manager, Data Engineering, Trust & Safety Technical Lead Manager for Trust & Safety Data Engineering team at OpenAI. This role involves leading a team, setting strategy, shaping data architecture, and driving execution on high-impact data systems for fraud and abuse detection, safety measurement, and ML feature generation. The focus is on building privacy-safe datasets and pipelines to support trust and safety initiatives. | DataPost-train | 7 |
| Software Engineer, Productivity - Training Runtime Software Engineer focused on improving the developer experience and productivity within OpenAI's core training and inference frameworks. This role involves optimizing iteration speed, CI, and testing strategies to support researchers working on frontier experiments. | DataServe | 7 |
| Simulation Environments Engineer The role focuses on building tooling and infrastructure for creating high-coverage, realistic virtual environments for robotics research and evaluation. This involves designing content pipelines, authoring procedural generators, and shipping tools for environment creation, quality control, and integration with simulation farms and ML evaluation pipelines. The role sits at the intersection of game-engine practice, asset engineering, and large-scale simulation infrastructure. | DataEval Gate | 7 |
| Analytics Engineer, Safety Systems The Analytics Engineer on the Safety Systems team will build data-centric culture and enhance decision-making through analytics. They will partner with Engineering, Research, and Data Science to develop and maintain canonical data sources and dashboards for safety metrics, enabling trustworthy insights for people and AI agents. The role owns the consumption layer for safety metrics, defining ways for stakeholders to understand product safety, answer questions, and inform product decisions and company strategy. This role is crucial for advancing the goals of safe, robust, and reliable AI. | Data | 5 |
| Program Manager, Human Data The Human Data team at OpenAI is looking for a Program Manager to design and execute programs for collecting high-quality human feedback data. This role involves partnering with research, operations, and engineering teams, managing external vendors and AI trainers, and ensuring the successful completion of data campaigns. The goal is to use this data to train and evaluate AI models, contributing to the development of safe and effective AI systems. | Data | 5 |
| Technical Program Manager, Robotics Data Acquisition This role is for a Technical Program Manager focused on scaling the systems for robotic data acquisition (DAQ) for model training and evaluation. It involves integrating hardware, software, and operations to ensure high-quality data production for robotics development. | DataEval Gate | 5 |
| Software Engineer, Data Infrastructure - Research Software Engineer focused on building and scaling the dataset infrastructure for LLM training and inference, including multimodal data, at OpenAI. | Data | 5 |
| Data Engineer OpenAI is seeking a Data Engineer to build and manage data pipelines for user event data integration into their data warehouse. This role will develop canonical datasets for key product metrics, support safety systems, and collaborate with research teams to aid in new model training. The position requires expertise in data engineering technologies and distributed processing frameworks. | Data | 5 |