Evals

Function

All Engineering · 1073 Research · 253 Product · 182

Status

Sort

1508 AI roles tagged evals.

Company	Title	Sector	AI score	Other tags
Zillow	AI Applied Scientist - PhD Intern, Generative Computer Vision	Consumer	9	Vision · Multimodal · Fine-tuning
Anthropic	Research Engineer, Virtual Collaborator (Cowork)	AI Frontier	9	RL post-training · Reward modeling · Synthetic data
xAI	Member of Technical Staff - Mid-training	AI Frontier	9	Synthetic data · Multimodal · RL post-training
Scale AI	Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI	Data AI	9	Synthetic data · RL post-training · Agent orchestration · Fine-tuning
Scale AI	Engineering Manager, AgentOps	Data AI	9	Agent orchestration · Agent research · Guardrails · RL post-training
OpenAI	Researcher, Pretraining Safety	AI Frontier	9	Pretraining · Frontier research · Model serving
Fireworks AI	Member of Technical Staff, Evals & Post-Training Product	Data AI	9	Fine-tuning
Zillow	AI Applied Scientist - PhD Intern, Foundational IQ	Consumer	9	Fine-tuning · Multimodal · Agent orchestration
Zillow	AI Applied Scientist - PhD Intern, 3D Computer Vision	Consumer	9	Vision · Multimodal · Fine-tuning
OpenAI	Offensive Security Engineer, Agent Products	AI Frontier	9	Agent orchestration · Tool use · Guardrails · Model serving · Inference infra
Gusto	Sr. Staff AI/ML Engineer	Fintech	9	Agent orchestration · RAG · Model serving · LLM observability · Guardrails
Cohere	Senior Research Scientist, Model Evaluation	AI Frontier	9	LLM observability · Fine-tuning
Wayve	Machine Learning Engineer	Robotics	9	Embodied AI · Model serving · Inference infra · Synthetic data · Fine-tuning
Anthropic	ML/Research Engineer, Safeguards	AI Frontier	9	Agent orchestration · Guardrails · Synthetic data · Agent research
Anthropic	Research Operations & Strategy Lead - Coding & Cybersecurity Data	AI Frontier	9	Agent research · Agent orchestration · Fine-tuning · RL post-training
Anthropic	Data Operations Manager - Computer Use & Tool Use	AI Frontier	9	Agent orchestration · RL post-training · Tool use · Agent research
Anthropic	Privacy Research Engineer, Safeguards	AI Frontier	9	Fine-tuning · RL post-training · Interpretability
Character AI	Research Engineer, AI Safety & Alignment	AI Frontier	9	Interpretability · RL post-training · Fine-tuning · Guardrails · LLM observability
OpenAI	Technical Lead, Safety Research	AI Frontier	9	RL post-training · Guardrails · Frontier research · Interpretability
OpenAI	Data Scientist, Codex	AI Frontier	9	Agent orchestration · Code gen
Anthropic	Research Engineer, Pretraining Scaling - London	AI Frontier	9	Pretraining · Model serving · Inference infra · LLM observability
Anthropic	Research Engineer / Research Scientist, Biology & Life Sciences	AI Frontier	9	Fine-tuning · RL post-training · Frontier research · Agent research
Anthropic	Research Engineer / Scientist, Tool Use Safety	AI Frontier	9	Agent orchestration · Tool use · Guardrails · RL post-training · Agent research · Fine-tuning · LLM observability
Sierra	Software Engineer, Agent (New Grad)	AI Frontier	9	Agent orchestration · RAG · Model serving · LLM observability
OpenAI	Forward Deployed Engineer - Munich	AI Frontier	9	Agent orchestration · Model serving · Inference infra · LLM observability
OpenAI	Forward Deployed Engineer - Paris	AI Frontier	9	Model serving · Inference infra · LLM observability · Agent orchestration
OpenAI	Forward Deployed Engineer - Dublin	AI Frontier	9	Agent orchestration · Model serving · Inference infra · LLM observability
Perplexity	Member of Technical Staff (Software Engineer, Applied AI)	AI Frontier	9	Agent orchestration · Recommender systems · Search & ranking · Fine-tuning · LLM observability
Shield AI	Product Manager, AI Platforms (R4991)	Defense	9	Multimodal · Training infra · Synthetic data · Inference infra · Model serving
Scale AI	Machine Learning Research Scientist, Reasoning	Data AI	9	Agent orchestration · Agent research · Fine-tuning · LLM observability