Evals

Function

All Engineering · 1466 Research · 384 Product · 247

Status

Sort

2097 AI roles tagged evals.

Company	Title	Sector	AI score	Other tags
Anthropic	Research Engineer / Research Scientist, Biology & Life Sciences	AI Frontier	9	Fine-tuning · RL post-training · Frontier research · Agent research
Anthropic	Research Engineer / Scientist, Tool Use Safety	AI Frontier	9	Agent orchestration · Tool use · Guardrails · RL post-training · Agent research · Fine-tuning · LLM observability
Sierra	Software Engineer, Agent (New Grad)	AI Frontier	9	Agent orchestration · RAG · Model serving · LLM observability
OpenAI	Forward Deployed Engineer - Munich	AI Frontier	9	Agent orchestration · Model serving · Inference infra · LLM observability
OpenAI	Forward Deployed Engineer - Paris	AI Frontier	9	Model serving · Inference infra · LLM observability · Agent orchestration
OpenAI	Forward Deployed Engineer - Dublin	AI Frontier	9	Agent orchestration · Model serving · Inference infra · LLM observability
Perplexity	Member of Technical Staff (Software Engineer, Applied AI)	AI Frontier	9	Agent orchestration · Recommender systems · Search & ranking · Fine-tuning · LLM observability
Shield AI	Product Manager, AI Platforms (R4991)	Defense	9	Multimodal · Training infra · Synthetic data · Inference infra · Model serving
Scale AI	Machine Learning Research Scientist, Reasoning	Data AI	9	Agent orchestration · Agent research · Fine-tuning · LLM observability
Glean	Machine Learning Engineer, AI Assistant & Autonomous AI Agents	Enterprise	9	Agent orchestration · Agent research · Inference infra · Model serving
ByteDance	Senior Software Engineer - AI for Security, Data/Application	Big Tech	9	Interpretability · RAG · LLM observability
Vectara	Senior Machine Learning Engineer	Data AI	9	RAG · Agent orchestration · LLM observability · Multimodal · Fine-tuning
Datadog	AI Research Engineer - Datadog AI Research (DAIR)	Enterprise	9	Multimodal · RL post-training · Agent orchestration · Frontier research · Model serving · Inference infra
Decagon	Senior Research Engineer	Vertical AI	9	Agent orchestration · Fine-tuning · Model serving · RAG · LLM observability
Sierra	Software Engineer, Agent (Spanish speaking)	AI Frontier	9	Agent orchestration · Model serving · RAG · LLM observability
Sierra	Software Engineer, Agent (French speaking)	AI Frontier	9	Agent orchestration · Model serving · RAG · Agent research
Sierra	Software Engineer, Agent (German speaking)	AI Frontier	9	Agent orchestration · Model serving · RAG
Datadog	AI Research Engineer - Datadog AI Research (DAIR)	Enterprise	9	Multimodal · Frontier research · RL post-training · RLHF · Agent orchestration · Model serving · Inference infra · Synthetic data
Anthropic	Research Engineer / Scientist, Robustness	AI Frontier	9	RL post-training · Agent research · Guardrails · LLM observability · Frontier research
OpenAI	Forward Deployed Engineer (FDE) - SF	AI Frontier	9	LLM observability · Model serving
OpenAI	Research Engineer, Codex	AI Frontier	9	Agent orchestration · Code gen · Agent research · Inference infra · Model serving
Anthropic	Research Engineer / Scientist, Tool Use	AI Frontier	9	Agent orchestration · Tool use · RL robotics · Guardrails · Fine-tuning · Model serving
Anthropic	Research Engineer, Model Performance & Quality	AI Frontier	9	LLM observability · Fine-tuning · RL post-training · Model serving
Anthropic	Research Engineer, Virtual Collaborator	AI Frontier	9	RL post-training · Fine-tuning
Anthropic	Research Scientist / Engineer, Agentic Learning (Horizons)	AI Frontier	9	Fine-tuning · RL post-training · Synthetic data
Cerebras	LLM Inference Performance & Evals Engineer	Semiconductors	9	Inference infra · Model serving
Anthropic	Research Engineer / Scientist, Model Welfare	AI Frontier	9	Interpretability
Anthropic	Research Engineer, Model Performance & Quality	AI Frontier	9	LLM observability · Fine-tuning · RL post-training · Model serving
Abridge	Machine Learning Scientist (All Levels)	Vertical AI	9	Fine-tuning · Model serving
Cohere	Senior Research Engineer, Model Evaluation	AI Frontier	9	LLM observability · Fine-tuning