Reward modeling
4 AI roles tagged reward_modeling.
Status
FilteredsectorEnterprise×
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Canva | Senior Research Scientist - Reinforcement Learning, MoEs | Enterprise | 9 | RL post-training · RLHF · Agent orchestration · Tool use · Multimodal · Model serving · Frontier research · Evals |
| Canva | Senior Research Scientist - Reinforcement Learning, MoEs | Enterprise | 9 | RL post-training · Frontier research · Agent orchestration · Multimodal · Model serving · Fine-tuning · Evals · Agent research |
| Adobe | Senior Applied Scientist | Enterprise | 8 | Fine-tuning · RL post-training · Model serving · Multimodal · Vision |
| Replit | Product Lead, Growth Marketing | Enterprise | 5 | Agent orchestration · RAG · Vector DB · Fine-tuning · Inference infra · Model serving · Recommender systems · Search & ranking · Multimodal · Evals · Guardrails · LLM observability · Frontier research · Interpretability · Synthetic data · Agent research · RL post-training · RLHF · RL robotics · Embodied AI |