2097 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Decagon | Senior Data Scientist | Vertical AI | 7 | Recommender systems |
| Amazon | Principal, PMT, Selling Partner Identity Verification | Big Tech | 7 | Agent orchestration · Guardrails |
| Salesforce | AFD360 Solution Engineer - Regulated Industries | Enterprise | 7 | Agent orchestration · RAG · Guardrails · LLM observability |
| Meta | Product Manager, Central Product | Big Tech | 7 | Agent orchestration |
| Notion | Application Security Engineer, AI Security | Enterprise | 7 | Agent orchestration · Guardrails |
| Apple | Data Scientist | Big Tech | 7 | Agent orchestration · RAG · LLM observability · Model serving |
| Intercom | Senior Data Scientist AI Tooling | Enterprise | 7 | Agent orchestration · Tool use · RAG · Vector DB |
| Netflix | Creative Evaluation Lead, Artwork | Big Tech | 7 | Fine-tuning · Model serving · Vision · Multimodal |
| 1Password | Senior Security Researcher | Enterprise | 7 | Agent orchestration · Agent research · Guardrails |
| 1Password | Staff Security Researcher | Enterprise | 7 | Agent research · Agent orchestration · Guardrails |
| 1Password | Principal Security Researcher | Enterprise | 7 | Agent research · Agent orchestration · Guardrails |
| NVIDIA | AI Benchmarking and Telemetry Engineer - NVIS | Semiconductors | 7 | Inference infra · Model serving · LLM observability |
| F5 | Solutions Engineer — AI & Data Science Specialist | Enterprise | 7 | LLM observability · Guardrails |
| Microsoft | Senior Data Scientist | Big Tech | 7 | Fine-tuning · Multimodal |
| Toast | Senior Analyst, Model Risk Management | Enterprise | 7 | Guardrails · LLM observability · RAG · Fine-tuning |
| Netflix | Director of Product, Catalog and Content Understanding, Content Platform Operations & Publishing | Big Tech | 7 | Vision |
| Rubrik | Application Security Engineer | Enterprise | 7 | Agent orchestration · Agent research · Guardrails · Fine-tuning · Model serving |
| Walmart | Staff, Software Engineer | Retail | 7 | Agent orchestration · LLM observability |
| Glean | AI Outcomes Manager, West | Enterprise | 7 | Agent orchestration · Tool use |
| Spotify | Content Designer, Personalization | Consumer | 7 | Agent orchestration · LLM observability · Guardrails |
| JPMorgan Chase | Lead Software Engineer - Full Stack AI/ML | Banking | 7 | RAG · Model serving |
| Apple | ML Engineering Manager, Gen AI Frameworks Team | Big Tech | 7 | Model serving · Inference infra |
| Descript | Engineering Manager, Narrative Editing | AI Frontier | 7 | Multimodal · Model serving |
| Anthropic | Manager of Applied AI Architecture, Industries | AI Frontier | 7 | LLM observability · Model serving |
| OpenAI | Model Policy Manager, Chemical & Biological Risk | AI Frontier | 7 | Guardrails · RL post-training · Multimodal |
| Harvey | Engineering Manager, Product Engineering | AI Frontier | 7 | Agent orchestration · RAG · Model serving |
| Sierra | Agent Experience Designer, Voice (Multilingual) | AI Frontier | 7 | Audio & speech |
| Microsoft | Member of Technical Staff - Responsible AI (CoreAI) | Big Tech | 7 | Multimodal · Fine-tuning · Model serving |
| JPMorgan Chase | Data Scientist Lead | Banking | 7 | RAG · LLM observability · Search & ranking |
| Decagon | Director of Customer Engineering, Agent Builder | Vertical AI | 7 | Agent orchestration · Guardrails · LLM observability |