1508 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Grafana Labs | Staff AI Engineer | US | Remote | Data AI | 9 | Agent orchestration · Tool use · RAG · LLM observability · Guardrails |
| Snowflake | Staff Research Scientist, AI Agents & LLMs | Data AI | 9 | Agent orchestration · Agent research · Fine-tuning · Model serving · Inference infra |
| OpenAI | Applied AI Engineer, Codex Core Agent | AI Frontier | 9 | Agent orchestration · Tool use · Fine-tuning · LLM observability · Code gen |
| Intercom | Principal Engineer, Fin AI Agent | Enterprise | 9 | Agent orchestration · LLM observability · Model serving |
| Scale AI | Research Scientist, Frontier Risk Evaluations | Data AI | 9 | Agent orchestration · Guardrails · Frontier research · LLM observability |
| Adobe | Principal Architect, Express AI Foundations | Enterprise | 9 | Agent orchestration · Model serving · Inference infra · LLM observability · Multimodal |
| NVIDIA | Senior AI ML Solution Engineer, AI-Native Development | Semiconductors | 9 | Agent orchestration · Tool use · Fine-tuning · RAG · Code gen |
| Zillow | Principal Machine Learning Engineer, Agentic AI | Consumer | 9 | Agent orchestration · Multimodal · Guardrails · LLM observability · Model serving · Agent research |
| OpenAI | AI Deployment Engineer, Startups | AI Frontier | 9 | Agent orchestration · Model serving · Fine-tuning · LLM observability |
| Snowflake | AI Engineer - Cortex Code Quality | Data AI | 9 | Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Code gen |
| Cohere | Member of Technical Staff, Safety for Agents | AI Frontier | 9 | RL post-training · Agent orchestration · Fine-tuning · LLM observability |
| Zillow | Principal Applied Scientist, Agentic AI | Consumer | 9 | RL post-training · RLHF · Reward modeling · Fine-tuning · Guardrails · Agent orchestration · Multimodal · Vector DB |
| Perplexity | Member of Technical Staff (Secure Intelligence Institute) | AI Frontier | 9 | Agent orchestration · Guardrails · Agent research |
| Cohere | Research Internship (Spring/Summer 2026) | AI Frontier | 9 | Frontier research · Pretraining · Fine-tuning · Multimodal · LLM observability |
| Scale AI | Research Scientist, Agent Robustness | Data AI | 9 | Agent orchestration · Agent research · Guardrails · RL post-training · Fine-tuning |
| Scale AI | Research Scientist, AI Controls and Monitoring | Data AI | 9 | LLM observability · Guardrails · Interpretability · RL post-training · Agent research |
| Canva | Staff Machine Learning Engineer - Integrations & Solutions Group (AU remote) | Enterprise | 9 | Agent orchestration · Tool use · LLM observability |
| Cohere | Product Manager, Agent Harness & Modelling | AI Frontier | 9 | Agent orchestration · Tool use · RAG · Agent research · Fine-tuning |
| Wayve | Principal Machine Learning Engineer, App SW | Robotics | 9 | Embodied AI · Model serving · Inference infra · Synthetic data · Fine-tuning |
| Baseten | Post-Training Applied Researcher | Data AI | 9 | Fine-tuning · RL post-training · Reward modeling · Agent orchestration · Tool use · Synthetic data · Model serving |
| Wayve | Machine Learning Engineer, AV Engineering | Robotics | 9 | Embodied AI · Fine-tuning · Synthetic data |
| Cresta | Senior Machine Learning Engineer - Voice Experience | Vertical AI | 9 | Audio & speech · Fine-tuning · Model serving · Inference infra · RAG · Agent orchestration · LLM observability |
| OpenAI | Machine Learning Engineer, Integrity | AI Frontier | 9 | Fine-tuning · Model serving · LLM observability · Guardrails |
| Walmart | Principal, Data Scientist | Retail | 9 | Agent orchestration · Tool use · RAG · Vector DB · Model serving · Inference infra |
| xAI | Member of Technical Staff - Voice Model | AI Frontier | 9 | Audio & speech · Fine-tuning · RL post-training · Inference infra · Model serving |
| Zillow | Senior Applied Scientist, Agentic AI | Consumer | 9 | Agent orchestration · Tool use · Fine-tuning · LLM observability · Agent research |
| Ramp | Agentic Operator, Growth Marketing | Fintech | 9 | Agent orchestration · Tool use · Guardrails · RAG · Fine-tuning · LLM observability |
| NVIDIA | Director, Perception - Autonomous Vehicles | Semiconductors | 9 | Vision · Multimodal · Model serving · Inference infra · Fine-tuning · Synthetic data |
| OpenAI | Research Engineer/Scientist - Human Alignment, Consumer Devices | AI Frontier | 9 | RL post-training · Reward modeling · Multimodal |
| OpenAI | Security Researcher, Codex Security | AI Frontier | 9 | Agent orchestration · Fine-tuning · Model serving |