RLHF
9 AI roles tagged rlhf.
Status
FilteredsectorAI Frontier×
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Anthropic | Anthropic AI Safety Fellow, UK | AI Frontier | 10 | Frontier research · Interpretability · Evals · Guardrails |
| xAI | Member of Technical Staff - Post-Training and RL | AI Frontier | 9 | RL post-training · Reward modeling · Fine-tuning |
| Anthropic | Anthropic Fellows Program — Reinforcement Learning | AI Frontier | 9 | RL post-training |
| Character AI | Research Engineer, Multimodal | AI Frontier | 9 | Fine-tuning · Multimodal · Vision · Audio & speech · Model serving · Inference infra · Synthetic data |
| Anthropic | Research Manager, Production Model Training | AI Frontier | 9 | Fine-tuning · Evals |
| xAI | Model Behavior Tutor - Style, Taste & Aesthetics | AI Frontier | 7 | Fine-tuning |
| xAI | Model Behavior Tutor - Wit & Conversation | AI Frontier | 7 | Evals · Fine-tuning |
| Anthropic | Data Operations Manager | AI Frontier | 7 | Agent orchestration · Tool use · Synthetic data |
| Mistral AI | Data Annotation Quality Specialist | AI Frontier | 5 | Synthetic data |