AI Frontier · AI lab
OpenAI currently has 235 active AI-related job listings. The majority of these roles are in the application stage, accounting for 32% of the total, followed closely by the agents stage at 29%. The dominant function for hiring is Engineering, with 168 positions. Frequent tech tags include model_serving, evals, and agent_orchestration, suggesting a focus on deploying and managing AI models. In the last 30 days, OpenAI posted 50 new AI roles, representing a 14% increase compared to the previous 30-day period.
Currently tracking 199 active AI roles, down 29% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $230k–$555k (avg $372k).
OpenAI currently has 254 active AI-related roles in our index. The most common open titles are: AI Deployment Engineer (4), Partner AI Deployment Engineer - AWS (4), AI Deployment Engineer - Startups (3), AI Deployment Engineer- Codex (3), AI Deployment Engineer, Startups (2). Most positions are in Engineering and Research.
OpenAI's active AI hiring is concentrated in: application (33%), agents (28%), serving infrastructure (10%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
OpenAI is hiring AI talent in: United States (203 roles), United Kingdom (14 roles), Japan (6 roles), Germany (5 roles).
Job postings at OpenAI most frequently reference: model serving, agent orchestration, evals, llm observability, inference infra.
In the past 30 days, OpenAI has posted 56 new AI-related roles.
| Title | Stage | AI score |
|---|---|---|
| Researcher, Training - London Researcher focused on pushing the frontier of LLM development, enhancing intelligence, efficiency, and adding new capabilities through research into architecture design, long-context, efficient attention, optimization, and scaling. The role involves designing, prototyping, and scaling new architectures, executing and analyzing experiments, optimizing model and computational performance, and contributing to training and inference infrastructure. Experience with major LLM training runs and evaluating deep learning architectures is desired. | Pretrain | 10 |
| Researcher, Context - Agent Post-Training Researcher focused on post-training of frontier AI agents, improving scaling of compute on context, and owning end-to-end improvements to the post-training stack including RL, data pipelines, graders, reward signals, and evals. The role involves building evals and environments, partnering with product teams, and working on early-training and alignment interventions to shape agent behavior and ship improvements into products. |
| Post-trainAgent |
| 10 |
| Researcher, Connectors - Agent Post-Training This role focuses on the post-training of frontier AI agents, specifically teaching them to interface with professional software and tools using code. The researcher will design and run experiments to improve agentic behavior, own improvements to the post-training stack (including RL, data pipelines, graders, reward signals, and evals), build evaluation environments, and work on early-training and alignment interventions. The goal is to enable agents to take useful actions across a user's digital context by connecting them with productivity and enterprise software, ultimately shipping improvements into products. | Post-trainAgent | 10 |
| Researcher, Computer Use - Agent Post-Training Research role focused on post-training frontier agents, teaching models to operate computers, navigate systems, use tools, and complete complex workflows. Involves designing experiments, owning post-training stacks (RL, data, graders, reward signals, evals), building evals and environments, partnering with product teams, and working on early-training and alignment interventions. | Post-trainAgent | 10 |
| Researcher, Misalignment Research Researcher focused on identifying, quantifying, and understanding future AGI misalignment risks. The role involves designing worst-case demonstrations, developing adversarial and system-level evaluations, creating automated red-teaming infrastructure, researching alignment technique failure modes, and publishing findings to influence safety strategy and product safeguards. | Eval Gate | 10 |
| Researcher, Loss of Control Researcher focused on mitigating loss of control risk in frontier AI models, designing and implementing an end-to-end mitigation stack for preventing, monitoring, detecting, containing, and enforcing against intentionally subversive or insufficiently controllable model behavior. This involves integrating safeguards across products and research, evaluating technical trade-offs, collaborating with risk modeling and evaluations teams, and executing rigorous testing and red-teaming workflows against advanced AI behaviors like sandbagging, monitor evasion, exploit-seeking, unsafe tool use, or strategic deception. | AgentEval Gate | 10 |
| Senior Research Engineer/Scientist - On-Device Transformer Models Research Engineer/Scientist focused on developing and optimizing on-device transformer models, including multimodal capabilities, for future computing devices. The role involves training and evaluating models, developing novel architectures, and translating research into practical applications, with a focus on performance optimization and rigorous scientific methodology. | Post-trainServe | 10 |
| Researcher, Synthetic RL Research Scientist role focused on developing novel reinforcement learning techniques using synthetic data, environments, and feedback to train and evaluate frontier AI models, with a focus on generalization and alignment. The role involves designing experiments, analyzing training dynamics, and integrating research into production pipelines. | Post-trainData | 10 |
| Research Engineer, Frontier Evals & Environments - Finance OpenAI is seeking a Research Engineer for the Frontier Evals team, focusing on building evaluations for AI models in the finance domain. The role involves identifying crucial financial capabilities, designing quantification methods, owning a research agenda for evaluation development, and refining frontier model assessments. This position is critical for steering AI progress towards safe AGI/ASI. | Eval Gate | 10 |
| Researcher, Interpretability Researcher focused on studying internal representations of deep learning models to understand model behavior and engineer more understandable representations, with a focus on AI safety and ensuring the safety of powerful AI systems. The role involves developing and publishing research, engineering infrastructure for studying model internals, and collaborating across teams. | Post-train | 10 |
| Research Engineer/Research Scientist, RL/Reasoning Research Engineer/Scientist focused on advancing AI alignment and capabilities using cutting-edge reinforcement learning methods to train intelligent, aligned, and general-purpose agents. The role involves pushing the boundaries of RL research, building next-generation generative models, and deploying them at scale, with a focus on core reasoning paradigms and innovations. | Post-trainAgent | 10 |
| Researcher, Training Researcher focused on developing and scaling new LLM architectures to improve intelligence and efficiency, with contributions to training and inference infrastructure. Requires deep understanding of LLM architectures and an empirical approach. | PretrainServe | 10 |
| Research Engineer, Frontier Evals & Environments Research Engineer focused on building environments and methodologies for measuring and steering frontier AI models towards safe AGI/ASI, influencing training and launch decisions. | Eval GatePost-train | 10 |
| Research Scientist Research Scientist roles at OpenAI focusing on developing innovative machine learning techniques and advancing the research agenda, with a focus on discovering generalizable ideas and contributing to a broad research vision. The role emphasizes owning a research agenda and pursuing long-running projects. | Pretrain | 10 |
| Distributed Training Engineer, Sora OpenAI is seeking a Distributed Training Engineer for their Sora team to improve training throughput for video models. This role involves optimizing the internal training framework, collaborating with researchers, and ensuring hardware efficiency for supercomputer training runs. | Pretrain | 10 |
| Research Engineer / Research Scientist -Personal AGI, Proactivity Research Engineer/Scientist focused on improving model proactivity and personalization for a collaborative assistant, involving RL, dataset creation, evaluations, and post-training methods, with a strong emphasis on product-driven research and collaboration with product teams. | Post-trainAgent | 9 |
| Forward Deployed Engineer - Stockholm This role focuses on deploying OpenAI's frontier AI models into production systems for strategic customers, involving end-to-end ownership from discovery and design to rollout and adoption. The engineer will build full-stack systems, contribute code, and codify working patterns into reusable tools, with success measured by production adoption, workflow impact, and feedback that influences product and model roadmaps. The role requires strong engineering and deployment experience, particularly with LLMs or generative models, and the ability to manage complex projects in ambiguous environments. | ShipAgent | 9 |
| Forward Deployed Engineer - Madrid This role focuses on deploying OpenAI's frontier AI models into production systems for strategic customers, leading the entire process from discovery and scoping to system design, build, and rollout. The engineer will work on full-stack systems, embed with customer teams, and contribute to product and model roadmaps through feedback and evaluations. The role requires strong engineering and deployment experience, particularly with LLMs, and the ability to manage complex projects in ambiguous environments. | ShipServe | 9 |
| Researcher, Agent Post-Training, Personality Researcher focused on post-training of AI agents to improve their collaborative personality, involving behavioral research, data creation, reward modeling, and collaboration with product teams to ship improved agent models. | Post-trainAgent | 9 |
| Technical Lead Manager - Training Runtime, Data(set) Movement Technical Lead Manager for Training Runtime, focusing on the Data Movement area. This role owns the infrastructure for supplying training jobs with data and managing model state during large-scale model training runs. It involves designing and building a unified dataset read platform, defining APIs, storage contracts, and ensuring reliability and reproducibility. | Data | 9 |
| Researcher: Agent Post-Training, API & Power-Users This role focuses on training frontier agents for OpenAI's products, including Codex, ChatGPT, and the API. The researcher will improve agent capabilities, reliability, and product fit for power users and API developers by designing experiments, building training environments, and developing post-training interventions. The role involves working across research, engineering, data, evals, and product to shape agentic model behavior for real-world workflows and API-based applications. | Post-trainAgent | 9 |
| RE/RS, Data Understanding - Foundations This role focuses on research and development of high-quality datasets for large model training at OpenAI. Responsibilities include synthesizing data, building VQ representations, and processing/filtering data. The role involves treating data quality as a research problem, developing new methods for data selection and transformation, and designing experiments to understand data's impact on model learning. The goal is to translate research into scalable data processing pipelines. | DataPretrain | 9 |
| RE/RS, Data Understanding (MM) This role focuses on preparing, curating, synthesizing, and understanding multimodal data (images, audio, video) at scale for large model training. It involves research and production problems related to data pipelines, quality filters, and using models for data preparation, with an emphasis on measuring dataset impact on model performance. | Data | 9 |
| Software Engineer, Cyber Frontier Software Engineer role focused on building AI systems and products for cyber threat understanding and response, improving safety and reliability of frontier models in security-sensitive settings. Responsibilities include defining and executing technical roadmaps, working with defenders and customers, shaping model training and access patterns, and building research and evaluation systems. | ShipEval Gate | 9 |
| Software Engineer, RL Training Infra Software Engineer focused on the infrastructure and engineering challenges of large-scale reinforcement learning training for frontier AI agents, including scaling, orchestration, inference, and reliability, with a secondary focus on agentic capabilities like multi-agent systems and memory. | Post-trainAgent | 9 |
| Researcher, Artifacts - Agent Post-Training Researcher focused on post-training frontier agent models to create polished work products like documents and spreadsheets. Owns improvements across RL, data pipelines, graders, reward signals, and evals. Partners with product teams to translate user needs into model improvements and ships capabilities into products. | Post-trainAgent | 9 |
| Manager, AI Deployment Engineering - Codex Manager for AI Deployment Engineering focused on the Codex product, responsible for leading a team that helps customers integrate and scale AI coding tools into their software development lifecycle. The role involves technical leadership, customer engagement strategy, and partnering with product and research teams to drive adoption and gather feedback. | Ship | 9 |
| Security Preparedness Lead, Coding Agents This role focuses on the security preparedness of internal AI coding and research agents, defending them against cyber threats and insider risks. It involves developing threat models, identifying critical security investments, and leading technical execution for security controls. | Agent | 9 |
| Security Researcher, Agentic AI Threats Research role focused on mitigating AI threats to global security, specifically by identifying and preparing for security threats from advanced internal AI agents. The role involves designing security controls and stress-testing defenses with AI agent evaluations. | Agent | 9 |
| AI Deployment Engineer - Startups AI Deployment Engineer role focused on working with strategic startup customers to optimize AI systems, identify failure modes, and translate learnings into product improvements and evaluation systems. This role involves prototyping prompts and agents, designing evaluations, and acting as a technical partner to customers. | AgentEval Gate | 9 |
| Performance & Systems Engineer, Codex The role focuses on optimizing the performance and cost of AI systems, specifically the Codex agents, which involve LLM inference, cloud orchestration, and agentic work management. The engineer will hunt down inefficiencies, build tooling for measurement and profiling, and collaborate to improve latency and cost. | ServeAgent | 9 |
| Researcher, Alignment Training Researcher focused on studying and shaping aligned behavior in frontier AI models through various training stages (pre-training, mid-training, post-training). The role involves developing synthetic data methods, building evaluation loops, designing data generation pipelines, and creating experiments to distinguish durable learned behavior from artifacts. Collaboration with other teams is key to translate research insights into better model behavior. | Post-trainData | 9 |
| Researcher, Alignment Science Research role focused on intent alignment for AI models, including instruction following, honesty, calibration, and robustness. Involves designing and running experiments, training models with RL, developing evaluations for failure modes, and integrating successful techniques into model development. Aims to produce publishable research and deployable techniques. | Post-trainEval Gate | 9 |
| Research Infrastructure Engineer, Training Systems This role is for a Research Infrastructure Engineer focused on ML training systems at OpenAI. The engineer will build and maintain the infrastructure that enables novel research ideas for large-scale model training, improving reliability, debuggability, and performance. The work involves debugging across various systems (Python, PyTorch, distributed systems, GPUs, networking, storage) and designing APIs for complex training workflows. | Data | 9 |
| Software Engineer, Inference - Performance Optimization Software Engineer focused on optimizing inference performance across application, model, and fleet layers. This role involves building performance models, analyzing inference workloads, enhancing tooling for bottleneck identification, and collaborating with teams to implement improvements and project future needs. The core of the role is to drive faster and cheaper inference. | Serve | 9 |
| Manager, Forward Deployed Engineering - Munich Manager for Forward Deployed Engineering team responsible for partnering with customers to turn research breakthroughs into production systems using frontier models. The role involves leading and growing a team, owning end-to-end delivery outcomes, and ensuring fieldwork informs roadmap priorities and supports safe deployment at scale. Requires strong leadership, technical project management, and experience with production-grade code. | ShipServe | 9 |
| Manager, Forward Deployed Engineering - London Manager for Forward Deployed Engineering team responsible for partnering with customers to turn research breakthroughs into production systems, owning end-to-end delivery outcomes, and informing roadmap priorities through fieldwork. Requires experience managing customer-facing engineers and leading high-pressure technical projects. | Ship | 9 |
| Researcher, Agentic Post-Training Researcher focused on post-training agentic models, developing horizontal improvements for factuality, instruction following, tool use, and multi-agent collaboration. The role involves building and improving training/evaluation infrastructure and creating evals to ensure models are ready for shipment, directly impacting models used by millions. | Post-trainAgent | 9 |
| Machine Learning Engineer, API Multicloud Machine Learning Engineer to build and improve AI systems for strategic partners adapting OpenAI models to cloud-native environments. This role involves post-training workflows, evaluation, data pipelines, model behavior, and API/infrastructure integration, focusing on customizing and deploying models safely and reliably. | Post-trainAgent | 9 |
| Engineering Manager, Multimodal (API) Engineering Manager to lead a team responsible for delivering innovative multimodal API products, including real-time processing, speech transcription, speech generation, and image creation. The role involves owning the product roadmap, collaborating with research teams, and guiding technical decisions for scalability and robustness. | ShipPost-train | 9 |
| AI Deployment Engineer | Codex AI Deployment Engineer focused on helping customers adopt OpenAI's coding tools (Codex) by guiding them in integrating Codex into their projects and workflows, designing and validating advanced AI workflows, and building demos and integrations. The role involves technical consulting, customer-facing leadership, and contributing to product strategy. | Ship | 9 |
| Software Engineer, Foundations Retrieval Software Engineer focused on building and scaling retrieval systems for agentic search, enabling models to retrieve and act on information. This role involves designing and operating indexing systems, retrieval pipelines, and serving layers, with a focus on performance, reliability, and observability at scale. It supports retrieval across OpenAI products and research, integrating with pretraining, inference, and product teams. | AgentServe | 9 |
| Manager, AI Deployment Engineering - Codex Manager for AI Deployment Engineering focused on the Codex product, responsible for leading a team that helps customers integrate and scale AI coding tools into their software development lifecycle, ensuring reliable and secure adoption. | Ship | 9 |
| Technical Deployment Lead - Tokyo This role is a founding Technical Deployment Lead at OpenAI, focused on partnering with customers to deliver complex AI systems. The role involves defining delivery processes, translating business needs into technical plans, managing execution across engineering and research teams, and ensuring customer adoption and value realization. It requires deep technical project management, ownership, and the ability to work in ambiguous, high-autonomy environments, with a focus on shipping AI/LLM systems to customers and codifying reusable patterns from field insights. | Ship | 9 |
| Researcher, Safety & Privacy Researcher focused on designing and building privacy-preserving safety systems for frontier AI models, involving auditable mechanisms for harm detection and mitigation while preserving user data privacy. The role aims to scale automated safety systems to minimize human review and address frontier risks. | Eval GatePost-train | 9 |
| Forward Deployed Engineer - Sydney Forward Deployed Engineers (FDEs) lead complex end-to-end deployments of frontier models in production alongside strategic customers. This role involves discovery, technical scoping, system design, build, and production rollout, partnering directly with customer engineering and domain teams. Success is measured by production adoption, workflow impact, and feedback that influences product and model roadmaps. The FDE will contribute to both customer delivery and core platform development, working closely with various internal teams and potentially contributing code directly. The role requires strong engineering and deployment experience, particularly with LLMs or generative models, and the ability to manage complex systems in ambiguous environments. | AgentServe | 9 |
| Applied AI Engineer, Codex Core Agent This role focuses on improving the performance, reliability, and usefulness of AI agents, specifically for software engineering tasks. It involves designing agent behaviors, developing evaluation metrics, optimizing through prompting and tool-use, analyzing production failures, and building feedback loops to enhance models and agent capabilities. The goal is to bridge the gap between research potential and real-world application, ensuring agents are dependable tools. | AgentPost-train | 9 |
| Software Engineer, Codex Core Agents Software Engineer focused on building and operating the production infrastructure for Codex agents, including sandboxed execution, orchestration, stateful workflows, and optimizing performance (tokens, latency, reliability, cost) for a fleet of AI agents. | AgentServe | 9 |
| AI Deployment Engineer, Startups AI Deployment Engineer focused on working with strategic startup customers to optimize their AI systems, prototype prompts/agents, and translate customer feedback into reproducible evaluations and product improvements for OpenAI's research and products. The role involves deep technical engagement, understanding system behavior, and building relationships within the startup ecosystem. | AgentEval Gate | 9 |
| TL, Research Inference This role focuses on building and optimizing high-performance inference systems for large-scale AI models, translating research ideas into efficient and scalable inference infrastructure. It involves owning core execution paths, distributed inference across multiple GPUs, and optimizing operators and kernels, with a strong emphasis on performance, correctness, and realism for research enablement. | Serve | 9 |