New AI postings mentioning Reinforcement Learning (RL) per week — 391 total over 12 weeks.
641 active AI roles across 96 companies mention Reinforcement Learning (RL). Category: ML Techniques.
Reinforcement Learning (RL) is a skill in the "ML Techniques" category. It currently appears in 641 active AI roles across 96 companies in our index.
The top employers with active AI roles mentioning Reinforcement Learning (RL) are: Amazon (159), Google (72), NVIDIA (47), Anthropic (26), OpenAI (21).
Over the last 12 weeks, 391 new AI postings mentioned Reinforcement Learning (RL). Demand is rising — up 139% in the last four weeks compared to the earliest four weeks in the window.
Roles requiring Reinforcement Learning (RL) are concentrated in: agents (38%), post-training (24%), application (17%). These stages follow a seven-stage AI lifecycle from data preparation through to shipped product.
Job postings that mention Reinforcement Learning (RL) most often also require: Machine Learning, Python, Large Language Models (LLMs), Agentic Systems, Software Engineering.
12 AI roles requiring this skill.
| Company | Title | Sector | AI score | Stage |
|---|---|---|---|---|
| Crusoe | Senior Director, AI Model LifeCycle | Data AI | 9 | L2 |
| Baseten | Post-Training Research Engineer | Data AI | 9 | L2 |
| Crusoe | Senior Staff Software Engineer, AI Model LifeCycle | Data AI | 9 | L2 |
| Crusoe | Senior Software Engineer, AI Model LifeCycle | Data AI | 9 | L2 |
| Crusoe | Staff Software Engineer, AI Model LifeCycle | Data AI | 9 | L2 |
| Scale AI | Engineering Manager, AgentOps | Data AI | 9 | L4 |
| Together AI | AI Researcher, Core ML (Turbo) | Data AI | 9 | L3 |
| Modal | Forward Deployed Engineer - ML | Data AI | 8 | L3 |
| Fireworks AI | Applied Machine Learning Engineer | Data AI | 8 | L6 |
| Weights & Biases | Staff Technical Program Manager - Cluster Orchestration & Applied Training | Data AI | 7 | L3 |
| Baseten | Software Engineer - Training Product | Data AI | 7 | L2 |
| Scale AI | Senior Software Engineer, GenAI | Data AI | 7 | L0 |