2097 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Uber | Senior Product Manager, AV Labs | Consumer | 7 | |
| Uber | Group Product Manager, AV Labs | Consumer | 7 | |
| xAI | Senior Analyst, Safety Operations | AI Frontier | 7 | Fine-tuning · RL post-training · Guardrails · LLM observability |
| Aurora Innovation | System Test Engineer II, Autonomy Behavior | Robotics | 7 | |
| Staff Software Engineer, Agent Foundations | Consumer | 7 | Agent orchestration · Tool use · Model serving · Inference infra · LLM observability | |
| DoorDash | Senior Software Engineer, Motion Planning – DoorDash Labs | Consumer | 7 | Agent orchestration · Model serving |
| Uber | Engineering Manager II, Evaluation & Simulation - AV Labs | Consumer | 7 | |
| Warner Bros Discovery | Senior Data Scientist - Video AI team, Bangalore | Media | 7 | Model serving · Inference infra · RAG · Vector DB · Fine-tuning · Vision · Multimodal |
| Vanta | Head of EPD Systems and AI Transformation | Enterprise | 7 | Agent orchestration · Tool use · Guardrails · LLM observability |
| Microsoft | Senior Consultant - Full Stack Apps A2 | Big Tech | 7 | RAG · Agent orchestration · LLM observability |
| Sierra | Software Engineer, Agent Architecture | AI Frontier | 7 | Agent orchestration · RAG · LLM observability |
| Amazon | Sr. Product Manager (Tech), Trustworthy Shopping Experience | Big Tech | 7 | LLM observability · Guardrails · Model serving |
| Eli Lilly | Principal Engineer - Quality Engineering | Pharma | 7 | LLM observability · RAG · Fine-tuning · Model serving · Audio & speech |
| Eli Lilly | Technical Lead Software Architect | Pharma | 7 | Agent orchestration · Agent research · RAG · Model serving · Inference infra |
| Adobe | Tech Lead / Architect | Enterprise | 7 | Agent orchestration · Agent research · LLM observability · Guardrails |
| Toast | GTM Engineer, Marketing Operations AI Innovation | Enterprise | 7 | Agent orchestration · Tool use · Fine-tuning · RAG |
| Microsoft | Principal Software Engineer - CoreAI | Big Tech | 7 | Agent orchestration · Agent research · Code gen |
| Plaid | Senior Data Scientist - Data Foundations & AI | Fintech | 7 | LLM observability |
| xAI | Manager, Safety Operations | AI Frontier | 7 | RL post-training · Guardrails · LLM observability |
| xAI | Senior Analyst, Legal Operations | AI Frontier | 7 | Agent orchestration · Guardrails · Fine-tuning · Synthetic data |
| Roblox | Senior Product Manager, AI Content & Communications Safety | Consumer | 7 | Guardrails · Multimodal |
| Microsoft | Director, MAVS Technology & Product | Big Tech | 7 | Agent orchestration · Guardrails |
| Staff Software Engineer, Applied AI, Search | Big Tech | 7 | Agent orchestration · Search & ranking · Recommender systems · Audio & speech · Fine-tuning · Model serving | |
| Adobe | Principal Product Manager, Research and AI | Enterprise | 7 | Multimodal · Fine-tuning · Model serving |
| Wayve | Robotaxi Technical Operations | Robotics | 7 | Embodied AI · Agent orchestration |
| Grafana Labs | Staff AI Engineer | US | Remote | Data AI | 7 | Agent orchestration · LLM observability · Guardrails |
| Grafana Labs | Staff AI Engineer | Canada | Remote | Data AI | 7 | Agent orchestration · RAG · LLM observability · Guardrails |
| Bank of America | Experience Design (XD) Principal, Content Design - Bank of America Experience Design | Banking | 7 | Agent orchestration · RAG · Guardrails · LLM observability |
| Adobe | Principal Strategic Program Manager | Enterprise | 7 | Model serving · Inference infra · Guardrails · LLM observability |
| Pinecone | Senior/Staff Software Engineer, Search & Retrieval Infrastructure | Data AI | 7 | Vector DB · RAG · Agent orchestration · Inference infra · Model serving · LLM observability |