2097 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| LangChain | Product Manager, LangSmith | Data AI | 8 | LLM observability · Agent orchestration · Model serving |
| Datadog | Senior AI Engineer - APM Experiences | Enterprise | 8 | Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Model serving |
| Microsoft | Principal Applied Scientist | Big Tech | 8 | Agent orchestration · RL post-training · Fine-tuning · Model serving · LLM observability |
| Microsoft | Research Intern - STAC, NYC (Sociotechnical Alignment Center) | Big Tech | 8 | LLM observability · Synthetic data · Interpretability |
| Intercom | Staff AI Product Manager | Enterprise | 8 | Agent orchestration · Model serving |
| Scale AI | Senior Machine Learning Engineer - Model Evaluations, Public Sector | Data AI | 8 | LLM observability · Agent orchestration · Guardrails · Multimodal |
| Snorkel AI | Applied AI Engineer - AI Solutions | Data AI | 8 | Agent orchestration · RAG · Fine-tuning · Vector DB · LLM observability |
| OpenAI | Forward Deployed Engineer - London | AI Frontier | 8 | Model serving · Inference infra · Agent orchestration · LLM observability |
| Anthropic | Forward Deployed Engineer, Applied AI | AI Frontier | 8 | Agent orchestration · Tool use · Fine-tuning · Model serving |
| Nuro | Technical Lead Manager, Autonomy Evaluation and Intelligence | Robotics | 8 | Agent research · Embodied AI · Agent orchestration · Model serving |
| Stripe | Machine Learning Engineer, Supportability | Fintech | 8 | Agent orchestration · LLM observability · Model serving |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Model serving · Inference infra · Guardrails · Vector DB · RAG · LLM observability · Fine-tuning · Agent orchestration |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Model serving · Inference infra · Fine-tuning · Guardrails · Vector DB · RAG · Agent orchestration · LLM observability |
| Sierra | Software Engineer, Agent | AI Frontier | 8 | Agent orchestration · RAG · Model serving · LLM observability |
| Zillow | AI Applied Scientist - PhD Intern, Evaluation Systems and Metrics | Consumer | 8 | Multimodal · Agent research · Guardrails |
| Amazon | Applied Scientist II, Strategic Account Services (SAS) | Big Tech | 8 | Model serving |
| Amazon | Principal Applied Scientist, Advertiser Growth, Amazon Sponsored Products & Brands | Big Tech | 8 | Agent orchestration · Fine-tuning · RL post-training · Recommender systems |
| Klaviyo | Senior AI Engineer | Enterprise | 8 | Agent orchestration · Fine-tuning · Model serving · Inference infra |
| Scale AI | STEM Fellow - Human Frontier Collective (UK) | Data AI | 8 | Frontier research |
| Descript | Senior Software Engineer, Agent | AI Frontier | 8 | Agent orchestration · Tool use · LLM observability · Multimodal |
| ZoomInfo | Senior Product Manager, Context Engineering | Enterprise | 8 | RAG · Vector DB · Agent orchestration · LLM observability · Model serving |
| Apple | AIML - Research Scientist, AI Interpretability & Visualization | Big Tech | 8 | Interpretability |
| Intercom | Senior Data Scientist - AI Tooling | Enterprise | 8 | Agent orchestration · Tool use · RAG · Vector DB |
| Abridge | Software Engineer, Gen AI Platform | Vertical AI | 8 | Agent orchestration · Tool use · LLM observability · RAG · Vector DB |
| Uber | Sr. Staff Engineer (Conversational/Voice AI) | Consumer | 8 | Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Audio & speech · Model serving · Multimodal |
| Apple | AI Data Scientist | Big Tech | 8 | LLM observability · RAG · Fine-tuning · Multimodal |
| Scale AI | AI Product Manager | Data AI | 8 | Agent orchestration · RL robotics · Embodied AI · Synthetic data |
| Software Engineer III, AI/ML GenAI, Google Cloud AI | Big Tech | 8 | Model serving · Inference infra · Fine-tuning · Multimodal · Vision · Audio & speech · Code gen | |
| OpenAI | Forward Deployed Engineer - Tokyo | AI Frontier | 8 | Model serving · LLM observability |
| LangChain | Fullstack Software Engineer, Applied AI | Data AI | 8 | Agent orchestration · RAG · LLM observability · Model serving |