AI Frontier · AI lab
Currently tracking 199 active AI roles, down 29% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $230k–$555k (avg $372k).
OpenAI currently has 235 active AI-related job listings. The majority of these roles are in the application stage, accounting for 32% of the total, followed closely by the agents stage at 29%. The dominant function for hiring is Engineering, with 168 positions. Frequent tech tags include model_serving, evals, and agent_orchestration, suggesting a focus on deploying and managing AI models. In the last 30 days, OpenAI posted 50 new AI roles, representing a 14% increase compared to the previous 30-day period.
OpenAI currently has 254 active AI-related roles in our index. The most common open titles are: AI Deployment Engineer (4), Partner AI Deployment Engineer - AWS (4), AI Deployment Engineer - Startups (3), AI Deployment Engineer- Codex (3), AI Deployment Engineer, Startups (2). Most positions are in Engineering and Research.
OpenAI's active AI hiring is concentrated in: application (33%), agents (28%), serving infrastructure (10%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
OpenAI is hiring AI talent in: United States (203 roles), United Kingdom (14 roles), Japan (6 roles), Germany (5 roles).
Job postings at OpenAI most frequently reference: model serving, agent orchestration, evals, llm observability, inference infra.
In the past 30 days, OpenAI has posted 56 new AI-related roles.
| Title | Stage | AI score |
|---|---|---|
| Researcher, Misalignment Research Researcher focused on identifying, quantifying, and understanding future AGI misalignment risks. The role involves designing worst-case demonstrations, developing adversarial and system-level evaluations, creating automated red-teaming infrastructure, researching alignment technique failure modes, and publishing findings to influence safety strategy and product safeguards. | Eval Gate | 10 |
| Research Engineer, Frontier Evals & Environments Research Engineer focused on building environments and methodologies for measuring and steering frontier AI models towards safe AGI/ASI, influencing training and launch decisions. | Eval GatePost-train |
| 10 |
| Researcher, Safety & Privacy Researcher focused on designing and building privacy-preserving safety systems for frontier AI models, involving auditable mechanisms for harm detection and mitigation while preserving user data privacy. The role aims to scale automated safety systems to minimize human review and address frontier risks. | Eval GatePost-train | 9 |
| Researcher, Automated Red Teaming This role leads the Automated Red Teaming (ART) effort, focusing on building scalable, research-driven systems to uncover failure modes in AI models and safeguards. The goal is to translate these findings into actionable improvements and reduce expected harm by identifying weaknesses early and reliably. The role involves research into automated classifier jailbreak discovery, bio threat-development elicitation, and CoT monitoring evasion probing, with a strong emphasis on applied research, evaluations, and building scalable automation. | Eval GateAgent | 9 |
| Researcher, Frontier Biological and Chemical Risks Researcher focused on identifying, tracking, and preparing for catastrophic risks related to frontier AI models, with a specific emphasis on biological, chemical, and cyber risks. The role involves designing and building evaluations for frontier AI models, contributing to risk management strategies, and ensuring the scientific validity of preparedness capability evaluations. | Eval Gate | 9 |
| Researcher, Safety Oversight Researcher focused on AI safety, specifically developing methods for oversight of frontier AI models, identifying and mitigating misuse and misalignment, and improving models' reasoning about human values. The role involves developing AI monitor models, designing red-teaming pipelines, and collaborating with cross-functional teams. | Eval GatePost-train | 9 |
| Researcher, Trustworthy AI Researcher focused on AI safety and societal impacts, translating policy problems into technical research, building methods for public input into model values, and increasing rigor of external assurances for AI model deployments. | Eval Gate | 9 |
| People Research Data Scientist, AI Fairness & Bias This role focuses on establishing how OpenAI evaluates AI-assisted People systems and talent processes by designing and conducting rigorous assessments to identify, measure, and mitigate potential bias across models, agents, and automated workflows. It involves defining fairness strategies, conducting algorithmic audits, evaluating human-AI decision systems, developing approaches for generative AI and agents, investigating sources of disparities, and building scalable fairness-evaluation infrastructure. | Eval GateAgent | 8 |
| Strategic Partnerships Lead, Education Research Scientist focused on building scientific and evaluation infrastructure to understand how AI systems affect learning, cognition, and capability development over time. The role involves designing rigorous studies, developing scalable evaluation methods, and measuring cognitive outcomes beyond engagement. It sits at the intersection of learning science, cognitive science, experimental design, LLM evaluation, and applied product research, with an initial focus on young users and education settings. This is an applied, empirical role focused on building evidence systems that are scientifically credible, operationally useful, and influential in model and product development. | Eval GatePost-train | 8 |
| Product Manager, Bio Safety Product Manager focused on biosecurity risk for OpenAI's frontier models. The role involves driving initiatives to ensure safe and impactful deployments, developing safety-focused product roadmaps, and collaborating with cross-functional teams. Key responsibilities include creating scalable evaluation and improvement systems, integrating AI safety research, and defining metrics for safety performance. | Eval Gate | 8 |
| Data Scientist, Safety Systems The Data Scientist, Safety Systems role focuses on establishing a data-driven approach to understand, evaluate, and monitor the safety of production AI systems. This involves defining and implementing metrics, creating dashboards, and collaborating with researchers and engineers to ensure safe AI deployment. The role emphasizes leadership in quantitative analysis and metric operationalization within a safety-focused team. | Eval GateData | 8 |
| Data Scientist, Integrity Measurement This role focuses on developing and implementing AI-first methods for measuring and monitoring complex harms on OpenAI's platforms. The Data Scientist will own measurement and metrics for severe usage harms, build productionised safety metrics, optimize LLM prompts for measurement, and leverage agentic products for automation. The role is crucial for ensuring the integrity and security of OpenAI's scaling technology. | Eval GateData | 8 |
| Data Scientist, Preparedness Data Scientist role focused on evaluating, improving, and building mitigation systems to prevent extreme harms from AI. This involves deep error analysis, root cause investigation, building monitoring frameworks, and identifying trends in blocking effectiveness to influence product and policy changes. The role requires strong analytical skills, SQL/Python proficiency, and experience in high-stakes domains like security or trust & safety. | Eval Gate | 8 |
| Technical Program Manager – Adversarial Model Research This role focuses on testing the safety and robustness of AI models through evaluations, red-teaming, and identifying failure modes. It involves leading programs to understand model behaviors, translating risks into research plans, and collaborating with research and engineering teams to integrate findings into model development and deployment cycles. The goal is to strengthen model reliability and public trust. | Eval GatePost-train | 8 |
| Backend Software Engineer (Evals) Backend Software Engineer to design and build an evals infrastructure for measuring the quality of OpenAI's support automation. The role involves building robust systems and backend services, integrating data, and collaborating with data science and research partners. Experience with AI agents, LLM evaluation methods, and distributed systems is required. | Eval Gate | 8 |
| Product Manager, Cyber Safety Product Manager for OpenAI's Safety Systems team, focusing on cybersecurity risk for frontier AI models. The role involves driving initiatives to ensure responsible deployment, developing safety roadmaps, and collaborating with research, engineering, and policy teams. Requires experience in AI safety, trust & safety, or integrity, with a background in cybersecurity and strong product management skills. | Eval Gate | 7 |
| AI Emerging Risks Analyst This role focuses on identifying and mitigating emerging risks and potential harms associated with frontier AI technologies. The analyst will use strategic foresight, quantitative and qualitative methodologies to scan signals, detect abuse patterns, and translate findings into actionable intelligence and risk mitigation proposals. Key responsibilities include mapping risks, building early warning systems, and contributing to product safety readiness. | Eval Gate | 7 |
| Product Manager, Safety Measurement Product Manager for Safety Measurement at OpenAI, responsible for defining the strategy and roadmap for platforms that measure harm and safeguard efficacy in production. This role involves partnering with research and engineering teams to integrate AI safety research into measurement products and represent quantitative safety progress to leadership. | Eval Gate | 7 |
| Data Scientist, Integrity Measurement Data Scientist focused on measuring and responding to adversarial threats and misuse on OpenAI's platforms, developing AI-first methods for prevalence estimation and productionised safety metrics, and optimizing LLM prompts for measurement. This role involves owning measurement and metrics for severe harm verticals, informing improvements to detection and enforcement, and leveraging agentic products for automation. | Eval Gate | 7 |