13 AI roles tagged reward_modeling.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Cognition | Research, Post-Training | Coding AI | 9 | RL post-training · RLHF · Evals · Agent research · Agent orchestration |
| Snorkel AI | Research Scientist - RL Training | Data AI | 9 | RL post-training · Fine-tuning · Synthetic data |
| Baseten | Post-Training Applied Researcher | Data AI | 9 | Fine-tuning · RL post-training · Agent orchestration · Tool use · Evals · Synthetic data · Model serving |
| OpenAI | Research Engineer/Scientist - Human Alignment, Consumer Devices | AI Frontier | 9 | RL post-training · Multimodal · Evals |
| Canva | Senior Research Scientist - Reinforcement Learning, MoEs | Enterprise | 9 | RL post-training · RLHF · Agent orchestration · Tool use · Multimodal · Model serving · Frontier research · Evals |
| Canva | Senior Research Scientist - Reinforcement Learning, MoEs | Enterprise | 9 | RL post-training · Frontier research · Agent orchestration · Multimodal · Model serving · Fine-tuning · Evals · Agent research |
| Anthropic | Research Engineer, Environment Scaling | AI Frontier | 9 | RL post-training · Fine-tuning · Synthetic data · Evals |
| Anthropic | Senior Research Scientist, Reward Models | AI Frontier | 9 | RL post-training · Evals · LLM observability · Frontier research |
| Anthropic | Research Engineer, Virtual Collaborator (Cowork) | AI Frontier | 9 | RL post-training · Synthetic data · Evals |
| Anthropic | Research Engineer, Reward Models | AI Frontier | 9 | RL post-training · Fine-tuning · LLM observability |
| Scale AI | Machine Learning Research Scientist, Post-Training | Data AI | 9 | RL post-training · Fine-tuning · Multimodal · Evals · Frontier research |
| Adobe | Senior Applied Scientist | Enterprise | 8 | Fine-tuning · RL post-training · Model serving · Multimodal · Vision |
| Walmart | Senior, Data Scientist | Retail | 8 | Evals · Vision · Multimodal · Fine-tuning · RLHF |