RLHF
7 AI roles tagged rlhf.
Status
FilteredsectorEnterprise×
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Canva | Senior Research Scientist - Reinforcement Learning, MoEs | Enterprise | 9 | RL post-training · Reward modeling · Agent orchestration · Tool use · Multimodal · Model serving · Frontier research · Evals |
| Datadog | AI Research Engineer - Datadog AI Research (DAIR) | Enterprise | 9 | Multimodal · Frontier research · RL post-training · Agent orchestration · Model serving · Inference infra · Evals · Synthetic data |
| Moveworks | Senior Machine Learning Engineer II, NLU & Agentic AI | Enterprise | 9 | Agent orchestration · Agent research · Fine-tuning · Evals · Multimodal · Model serving · LLM observability |
| Moveworks | Senior Machine Learning Engineer II, NLU & Agentic AI | Enterprise | 9 | Agent orchestration · Agent research · Fine-tuning · Evals · Multimodal · Model serving · LLM observability |
| ServiceNow | Staff Machine Learning Engineer, Agentic AI Systems - Moveworks | Enterprise | 8 | Agent orchestration · Tool use · Evals · Fine-tuning · Model serving · Agent research · LLM observability · Multimodal |
| Canva | Senior Machine Learning Engineer - Multimodal Data | Enterprise | 8 | Multimodal · Agent orchestration · Fine-tuning · Synthetic data · LLM observability |
| Replit | Product Lead, Growth Marketing | Enterprise | 5 | Agent orchestration · RAG · Vector DB · Fine-tuning · Inference infra · Model serving · Recommender systems · Search & ranking · Multimodal · Evals · Guardrails · LLM observability · Frontier research · Interpretability · Synthetic data · Agent research · RL post-training · Reward modeling · RL robotics · Embodied AI |