18 AI roles tagged rlhf.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Anthropic | Anthropic AI Safety Fellow, UK | AI Frontier | 10 | Frontier research · Interpretability · Evals · Guardrails |
| Cognition | Research, Post-Training | Coding AI | 9 | RL post-training · Reward modeling · Evals · Agent research · Agent orchestration |
| Anthropic | Anthropic Fellows Program — Reinforcement Learning | AI Frontier | 9 | RL post-training |
| Character AI | Research Engineer, Multimodal | AI Frontier | 9 | Fine-tuning · Multimodal · Vision · Audio & speech · Model serving · Inference infra · Synthetic data |
| Capital One | Applied Researcher I (AI Foundations) | Banking | 9 | Pretraining · Fine-tuning · Frontier research · Vector DB |
| Capital One | Applied Researcher II | Banking | 9 | Fine-tuning · Frontier research · Vector DB · Pretraining |
| Canva | Senior Research Scientist - Reinforcement Learning, MoEs | Enterprise | 9 | RL post-training · Reward modeling · Agent orchestration · Tool use · Multimodal · Model serving · Frontier research · Evals |
| Cohere | Research Engineer | AI Frontier | 9 | Frontier research · Fine-tuning · Evals · Model serving · Agent orchestration |
| Datadog | AI Research Engineer - Datadog AI Research (DAIR) | Enterprise | 9 | Multimodal · Frontier research · RL post-training · Agent orchestration · Model serving · Inference infra · Evals · Synthetic data |
| Anthropic | Research Manager, Production Model Training | AI Frontier | 9 | Fine-tuning · Evals |
| Capital One | Applied Researcher I | Banking | 8 | Fine-tuning · Frontier research · Vector DB |
| Capital One | Applied Researcher I | Banking | 8 | Fine-tuning · Frontier research · Interpretability · Vector DB · Recommender systems · Model serving |
| Capital One | Applied Researcher II (AI Foundations) | Banking | 8 | Pretraining · Fine-tuning · Vector DB |
| Capital One | Applied Researcher I (AI Foundations) | Banking | 8 | Pretraining · Fine-tuning · Vector DB · Frontier research · Interpretability |
| Walmart | Senior, Data Scientist | Retail | 8 | Evals · Vision · Multimodal · Fine-tuning · Reward modeling |
| Handshake | AI Tutor, Electrochemistry & Functional Materials Specialist (contract), Handshake AI | Enterprise | 7 | Evals · Guardrails |
| Handshake | Mathematics PhDs - AI Trainer | Enterprise | 5 | Evals |
| Handshake | Music Producer - AI Trainer | Enterprise | 5 | Evals |