2097 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Fine-tuning · Inference infra · Model serving · Guardrails · LLM observability · RAG · Vector DB |
| Anthropic | Applied AI Engineer, Enterprise Tech | AI Frontier | 8 | Agent orchestration · Fine-tuning · Model serving · RAG |
| Apple | Computer Vision and Machine Learning Engineer, Creativity Apps | Big Tech | 8 | Vision · Fine-tuning · Model serving |
| xAI | Model Behavior Tutor - Epistemic Rigor & Truthfulness | AI Frontier | 8 | Guardrails · Interpretability · Agent research |
| Anthropic | Applied AI Architect, Startups | AI Frontier | 8 | Agent orchestration · Model serving · RAG · Vector DB · Fine-tuning |
| Snorkel AI | Senior Applied AI Engineer - Dubai | Data AI | 8 | RAG · Fine-tuning · Agent orchestration · Synthetic data |
| Sierra | Software Engineer, Agent Builder | AI Frontier | 8 | Agent orchestration · Agent research |
| OpenAI | Backend Software Engineer (Evals) | AI Frontier | 8 | LLM observability · Agent orchestration · Tool use |
| Microsoft | Applied Scientist II / Senior Applied Scientist - Responsible AI (CoreAI) | Big Tech | 8 | Fine-tuning · RLHF · Guardrails · Agent orchestration · Agent research |
| Weights & Biases | Principal Engineer - Perf and Benchmarking | Data AI | 8 | Inference infra · Model serving · Training infra · LLM observability · Vision · Audio & speech |
| Apollo.io | Product Builder (Product Manager), AI Agents | Enterprise | 8 | Agent orchestration · Agent research · RAG · LLM observability |
| Netflix | Research Scientist (L5) - Content Understanding | Big Tech | 8 | Multimodal · Fine-tuning · LLM observability |
| Capital One | Lead AI Engineer | Banking | 8 | Model serving · Inference infra · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability |
| Microsoft | Member of Technical Staff, Applied Scientist | Big Tech | 8 | Agent orchestration · Fine-tuning |
| Cohere | Member of Technical Staff, Data Analysis and Evaluation | AI Frontier | 8 | Fine-tuning · Pretraining |
| Anthropic | Applied AI Architect, Startups | AI Frontier | 8 | Agent orchestration · LLM observability · RAG · Vector DB · Model serving |
| Intercom | Staff AI Product Manager | Enterprise | 8 | Agent orchestration · Model serving |
| Abridge | Director, Product Management - AI/ML, Core Product | Vertical AI | 8 | LLM observability · Fine-tuning · Model serving |
| Microsoft | Research Intern - Microsoft Teams (PhD) | Big Tech | 8 | Agent research · Multi-agent · RL post-training · Fine-tuning · Synthetic data · Guardrails |
| Databricks | AI Engineer - FDE (Forward Deployed Engineer) | Data AI | 8 | RAG · Agent orchestration · Fine-tuning · Model serving |
| Databricks | AI Engineer - FDE (Forward Deployed Engineer) | Data AI | 8 | RAG · Agent orchestration · Fine-tuning · Model serving |
| Microsoft | Member of Technical Staff - Post Training - MAI Superintelligence Team | Big Tech | 8 | RL post-training · Reward modeling · Fine-tuning |
| Sierra | Software Engineer, Agent (Arabic speaking) | AI Frontier | 8 | Agent orchestration · Model serving · RAG · Agent research |
| Scale AI | Senior Forward Deployed Data Scientist/Engineer | Data AI | 8 | Model serving · LLM observability |
| Datadog | Senior Software Engineer (Agent Engineer) - AI Code Gen | Enterprise | 8 | Agent orchestration · Code gen · LLM observability |
| Capital One | Senior Manager, Data Scientist - US Card (Generative AI Systems) | Banking | 8 | Vision · Multimodal · Model serving · Fine-tuning |
| Glean | Machine Learning Engineer, Enterprise Brain | Enterprise | 8 | Agent orchestration · Agent research · Fine-tuning · RL post-training · LLM observability · Recommender systems · Search & ranking |
| Anthropic | Research Scientist, Life Sciences | AI Frontier | 8 | Fine-tuning |
| Snorkel AI | Applied AI Engineer - Federal (TS Required) | Data AI | 8 | Agent orchestration · RAG · Fine-tuning · Vector DB · Synthetic data · Model serving |
| Roblox | Senior Data Scientist - Generative AI | Consumer | 8 | LLM observability · Fine-tuning · Agent orchestration · Multimodal · Code gen |