48 active AI roles across 20 companies mention Proximal Policy Optimization (PPO). Category: ML Techniques.
New AI postings mentioning Proximal Policy Optimization (PPO) per week — 39 total over 13 weeks.
51 AI roles requiring this skill.
| Company | Title | Sector | AI score | Stage |
|---|---|---|---|---|
| Black Forest Labs | Member of Technical Staff - Pretraining | Multimodal | 10 | L1 |
| OpenAI | Researcher, Loss of Control | AI Frontier | 10 | L4 |
| Amazon | Applied Scientist, Safe RL, Robotics, SAF Lab | Big Tech | 9 | L6 |
| Amazon | Senior Applied Scientist, AGI Customization | Big Tech | 9 | L2 |
| CrowdStrike | Data Scientist, Agentic Systems (Remote) | Enterprise | 9 | L4 |
| Research Engineer, Embodied Agents, DeepMind | Big Tech | 9 | L4 | |
| CrowdStrike | Director, Model Post-Training and Agentic Research (Remote) | Enterprise | 9 | L2 |
| Deloitte | Research Engineer — Post-Training & Small Language Models (SLMs), Healthcare AI | Consulting | 9 | L2 |
| Amazon | Senior Applied Scientist, Selling Partner Support Engagement | Big Tech | 9 | L4 |
| Amazon | Applied Scientist, Selling Partner Support Engagement | Big Tech | 9 | L4 |
| NVIDIA | Senior Quantum Applied Research Scientist, Calibration and Decoding | Semiconductors | 9 | L2 |
| Snowflake | Staff Research Scientist, Exotic AI | Data AI | 9 | L0 |
| Skydio | PhD Autonomy Engineer Intern - Planning & Controls (Reinforcement Learning) | Defense | 9 | L4 |
| Autodesk | Principal AI Research Scientist Post-Training Alignment | Enterprise | 9 | L2 |
| Autodesk | Principal AI Research Scientist Post-Training · Alignment · Reinforcement Learning Autodesk AI Lab: London · San Francisco · Toronto · Remote (US/CA/EU | Enterprise | 9 | L2 |
| Autodesk | Research Lead / Principal Scientist & Manager Post-Training · Alignment · Reinforcement Learning Autodesk AI Lab: Toronto · Remote (CA) | Enterprise | 9 | L2 |
| Amazon | Applied Scientist - Agentic AI, Amazon Fulfillment Technology | Big Tech | 9 | L4 |
| Amazon | Applied Scientist, Trustworthy Shopping Experience (TSE) | Big Tech | 9 | L4 |
| Mistral AI | Open-Source Software, Machine Learning Engineer | AI Frontier | 9 | L3 |
| NVIDIA | Senior Machine Learning and Simulation Engineer - Autonomous Vehicles | Semiconductors | 9 | L2 |
| Figure AI | Helix AI Engineer, Reinforcement Learning | Robotics | 9 | L2 |
| NVIDIA | Agent RL Infra Engineer | Semiconductors | 9 | L2 |
| OpenAI | Machine Learning Engineer, Integrity | AI Frontier | 9 | L2 |
| OpenAI | Researcher, Frontier Cybersecurity Risks | AI Frontier | 9 | L4 |
| OpenAI | Research Engineer / Machine Learning Engineer - Applied Voice | AI Frontier | 9 | L2 |
| Figma | AI Applied Scientist | Enterprise | 9 | L2 |
| Amazon | Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training | Big Tech | 9 | L1 |
| OpenAI | Research Engineer, Applied AI Engineering | AI Frontier | 9 | L6 |
| AMD | Principle AI Software Engineer | Semiconductors | 8 | L3 |
| Sr. Staff Machine Learning Engineer, Agentic Ads | Consumer | 8 | L4 |
Proximal Policy Optimization (PPO) is a skill in the "ML Techniques" category. It currently appears in 48 active AI roles across 20 companies in our index.
The top employers with active AI roles mentioning Proximal Policy Optimization (PPO) are: Amazon (14), NVIDIA (7), OpenAI (5), Autodesk (3), Pinterest (2).
Over the last 13 weeks, 39 new AI postings mentioned Proximal Policy Optimization (PPO). Demand is rising — up 171% in the last four weeks compared to the earliest four weeks in the window.
Roles requiring Proximal Policy Optimization (PPO) are concentrated in: agents (33%), post-training (29%), application (23%). These stages follow a seven-stage AI lifecycle from data preparation through to shipped product.