641 active AI roles across 96 companies mention Reinforcement Learning (RL). Category: ML Techniques.
New AI postings mentioning Reinforcement Learning (RL) per week — 391 total over 12 weeks.
Reinforcement Learning (RL) is a skill in the "ML Techniques" category. It currently appears in 641 active AI roles across 96 companies in our index.
The top employers with active AI roles mentioning Reinforcement Learning (RL) are: Amazon (159), Google (72), NVIDIA (47), Anthropic (26), OpenAI (21).
Over the last 12 weeks, 391 new AI postings mentioned Reinforcement Learning (RL). Demand is rising — up 139% in the last four weeks compared to the earliest four weeks in the window.
Roles requiring Reinforcement Learning (RL) are concentrated in: agents (38%), post-training (24%), application (17%). These stages follow a seven-stage AI lifecycle from data preparation through to shipped product.
Job postings that mention Reinforcement Learning (RL) most often also require: Machine Learning, Python, Large Language Models (LLMs), Agentic Systems, Software Engineering.
641 AI roles requiring this skill.
| Company | Title | Sector | AI score | Stage |
|---|---|---|---|---|
| xAI | Member of Technical Staff - Sandbox Service | AI Frontier | 7 | L4 |
| Software Engineer III, AI/ML, Google Cloud AI | Big Tech | 7 | L3 | |
| Roblox | [2026] Senior Machine Learning Engineer, Engine Optimization - PhD Early Career | Consumer | 7 | L3 |
| Cresta | Associate Forward Deployed Product Manager | Vertical AI | 7 | L4 |
| Braze | Forward-Deployed Data Scientist | Enterprise | 7 | L2 |
| Shield AI | Senior Staff Engineer, Autonomy - Tactical Behaviors (R4073) | Defense | 7 | L4 |
| Amazon | Sr. Applied Scientist, Prime Video - Personalization and Discovery Science | Big Tech | 7 | L6 |
| Scale AI | Senior Software Engineer, GenAI | Data AI | 7 | L0 |
| Walmart | Staff, Data Scientist | Retail | 7 | L4 |
| Figma | Software Engineer, Machine Learning | Enterprise | 7 | L4 |
| Netflix | Software Engineer 4/5– AI for Member Systems | Big Tech | 7 | L6 |