← Tag co-occurrence network

Reward modeling

2 AI roles tagged reward_modeling.

Company	Title	Sector	AI score	Other tags
Zillow	Principal Applied Scientist, Agentic AI	Consumer	9	RL post-training · RLHF · Fine-tuning · Guardrails · Agent orchestration · Evals · Multimodal · Vector DB
Whatnot	Software Engineer, Trust & Risk	Consumer	7	Agent orchestration · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Inference infra · Model serving · Recommender systems · Search & ranking · Vision · Audio & speech · Frontier research · Interpretability · Synthetic data · Agent research · RL post-training · RLHF · RL robotics · Embodied AI