Evals

Function

All Engineering · 1073 Research · 253 Product · 182

Status

Sort

1508 AI roles tagged evals.

Company	Title	Sector	AI score	Other tags
Grafana Labs	Staff AI Engineer \| US \| Remote	Data AI	9	Agent orchestration · Tool use · RAG · LLM observability · Guardrails
Snowflake	Staff Research Scientist, AI Agents & LLMs	Data AI	9	Agent orchestration · Agent research · Fine-tuning · Model serving · Inference infra
OpenAI	Applied AI Engineer, Codex Core Agent	AI Frontier	9	Agent orchestration · Tool use · Fine-tuning · LLM observability · Code gen
Intercom	Principal Engineer, Fin AI Agent	Enterprise	9	Agent orchestration · LLM observability · Model serving
Scale AI	Research Scientist, Frontier Risk Evaluations	Data AI	9	Agent orchestration · Guardrails · Frontier research · LLM observability
Adobe	Principal Architect, Express AI Foundations	Enterprise	9	Agent orchestration · Model serving · Inference infra · LLM observability · Multimodal
NVIDIA	Senior AI ML Solution Engineer, AI-Native Development	Semiconductors	9	Agent orchestration · Tool use · Fine-tuning · RAG · Code gen
Zillow	Principal Machine Learning Engineer, Agentic AI	Consumer	9	Agent orchestration · Multimodal · Guardrails · LLM observability · Model serving · Agent research
OpenAI	AI Deployment Engineer, Startups	AI Frontier	9	Agent orchestration · Model serving · Fine-tuning · LLM observability
Snowflake	AI Engineer - Cortex Code Quality	Data AI	9	Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Code gen
Cohere	Member of Technical Staff, Safety for Agents	AI Frontier	9	RL post-training · Agent orchestration · Fine-tuning · LLM observability
Zillow	Principal Applied Scientist, Agentic AI	Consumer	9	RL post-training · RLHF · Reward modeling · Fine-tuning · Guardrails · Agent orchestration · Multimodal · Vector DB
Perplexity	Member of Technical Staff (Secure Intelligence Institute)	AI Frontier	9	Agent orchestration · Guardrails · Agent research
Cohere	Research Internship (Spring/Summer 2026)	AI Frontier	9	Frontier research · Pretraining · Fine-tuning · Multimodal · LLM observability
Scale AI	Research Scientist, Agent Robustness	Data AI	9	Agent orchestration · Agent research · Guardrails · RL post-training · Fine-tuning
Scale AI	Research Scientist, AI Controls and Monitoring	Data AI	9	LLM observability · Guardrails · Interpretability · RL post-training · Agent research
Canva	Staff Machine Learning Engineer - Integrations & Solutions Group (AU remote)	Enterprise	9	Agent orchestration · Tool use · LLM observability
Cohere	Product Manager, Agent Harness & Modelling	AI Frontier	9	Agent orchestration · Tool use · RAG · Agent research · Fine-tuning
Wayve	Principal Machine Learning Engineer, App SW	Robotics	9	Embodied AI · Model serving · Inference infra · Synthetic data · Fine-tuning
Baseten	Post-Training Applied Researcher	Data AI	9	Fine-tuning · RL post-training · Reward modeling · Agent orchestration · Tool use · Synthetic data · Model serving
Wayve	Machine Learning Engineer, AV Engineering	Robotics	9	Embodied AI · Fine-tuning · Synthetic data
Cresta	Senior Machine Learning Engineer - Voice Experience	Vertical AI	9	Audio & speech · Fine-tuning · Model serving · Inference infra · RAG · Agent orchestration · LLM observability
OpenAI	Machine Learning Engineer, Integrity	AI Frontier	9	Fine-tuning · Model serving · LLM observability · Guardrails
Walmart	Principal, Data Scientist	Retail	9	Agent orchestration · Tool use · RAG · Vector DB · Model serving · Inference infra
xAI	Member of Technical Staff - Voice Model	AI Frontier	9	Audio & speech · Fine-tuning · RL post-training · Inference infra · Model serving
Zillow	Senior Applied Scientist, Agentic AI	Consumer	9	Agent orchestration · Tool use · Fine-tuning · LLM observability · Agent research
Ramp	Agentic Operator, Growth Marketing	Fintech	9	Agent orchestration · Tool use · Guardrails · RAG · Fine-tuning · LLM observability
NVIDIA	Director, Perception - Autonomous Vehicles	Semiconductors	9	Vision · Multimodal · Model serving · Inference infra · Fine-tuning · Synthetic data
OpenAI	Research Engineer/Scientist - Human Alignment, Consumer Devices	AI Frontier	9	RL post-training · Reward modeling · Multimodal
OpenAI	Security Researcher, Codex Security	AI Frontier	9	Agent orchestration · Fine-tuning · Model serving