1508 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Zillow | AI Applied Scientist - PhD Intern, Generative Computer Vision | Consumer | 9 | Vision · Multimodal · Fine-tuning |
| Anthropic | Research Engineer, Virtual Collaborator (Cowork) | AI Frontier | 9 | RL post-training · Reward modeling · Synthetic data |
| xAI | Member of Technical Staff - Mid-training | AI Frontier | 9 | Synthetic data · Multimodal · RL post-training |
| Scale AI | Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI | Data AI | 9 | Synthetic data · RL post-training · Agent orchestration · Fine-tuning |
| Scale AI | Engineering Manager, AgentOps | Data AI | 9 | Agent orchestration · Agent research · Guardrails · RL post-training |
| OpenAI | Researcher, Pretraining Safety | AI Frontier | 9 | Pretraining · Frontier research · Model serving |
| Fireworks AI | Member of Technical Staff, Evals & Post-Training Product | Data AI | 9 | Fine-tuning |
| Zillow | AI Applied Scientist - PhD Intern, Foundational IQ | Consumer | 9 | Fine-tuning · Multimodal · Agent orchestration |
| Zillow | AI Applied Scientist - PhD Intern, 3D Computer Vision | Consumer | 9 | Vision · Multimodal · Fine-tuning |
| OpenAI | Offensive Security Engineer, Agent Products | AI Frontier | 9 | Agent orchestration · Tool use · Guardrails · Model serving · Inference infra |
| Gusto | Sr. Staff AI/ML Engineer | Fintech | 9 | Agent orchestration · RAG · Model serving · LLM observability · Guardrails |
| Cohere | Senior Research Scientist, Model Evaluation | AI Frontier | 9 | LLM observability · Fine-tuning |
| Wayve | Machine Learning Engineer | Robotics | 9 | Embodied AI · Model serving · Inference infra · Synthetic data · Fine-tuning |
| Anthropic | ML/Research Engineer, Safeguards | AI Frontier | 9 | Agent orchestration · Guardrails · Synthetic data · Agent research |
| Anthropic | Research Operations & Strategy Lead - Coding & Cybersecurity Data | AI Frontier | 9 | Agent research · Agent orchestration · Fine-tuning · RL post-training |
| Anthropic | Data Operations Manager - Computer Use & Tool Use | AI Frontier | 9 | Agent orchestration · RL post-training · Tool use · Agent research |
| Anthropic | Privacy Research Engineer, Safeguards | AI Frontier | 9 | Fine-tuning · RL post-training · Interpretability |
| Character AI | Research Engineer, AI Safety & Alignment | AI Frontier | 9 | Interpretability · RL post-training · Fine-tuning · Guardrails · LLM observability |
| OpenAI | Technical Lead, Safety Research | AI Frontier | 9 | RL post-training · Guardrails · Frontier research · Interpretability |
| OpenAI | Data Scientist, Codex | AI Frontier | 9 | Agent orchestration · Code gen |
| Anthropic | Research Engineer, Pretraining Scaling - London | AI Frontier | 9 | Pretraining · Model serving · Inference infra · LLM observability |
| Anthropic | Research Engineer / Research Scientist, Biology & Life Sciences | AI Frontier | 9 | Fine-tuning · RL post-training · Frontier research · Agent research |
| Anthropic | Research Engineer / Scientist, Tool Use Safety | AI Frontier | 9 | Agent orchestration · Tool use · Guardrails · RL post-training · Agent research · Fine-tuning · LLM observability |
| Sierra | Software Engineer, Agent (New Grad) | AI Frontier | 9 | Agent orchestration · RAG · Model serving · LLM observability |
| OpenAI | Forward Deployed Engineer - Munich | AI Frontier | 9 | Agent orchestration · Model serving · Inference infra · LLM observability |
| OpenAI | Forward Deployed Engineer - Paris | AI Frontier | 9 | Model serving · Inference infra · LLM observability · Agent orchestration |
| OpenAI | Forward Deployed Engineer - Dublin | AI Frontier | 9 | Agent orchestration · Model serving · Inference infra · LLM observability |
| Perplexity | Member of Technical Staff (Software Engineer, Applied AI) | AI Frontier | 9 | Agent orchestration · Recommender systems · Search & ranking · Fine-tuning · LLM observability |
| Shield AI | Product Manager, AI Platforms (R4991) | Defense | 9 | Multimodal · Training infra · Synthetic data · Inference infra · Model serving |
| Scale AI | Machine Learning Research Scientist, Reasoning | Data AI | 9 | Agent orchestration · Agent research · Fine-tuning · LLM observability |