2097 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Apple | Machine Learning Engineer - Ads Relevance & Quality | Big Tech | 7 | Recommender systems · Search & ranking · Fine-tuning · Multimodal |
| Toast | Senior Product Manager, Support AI Experimentation | Enterprise | 7 | Agent research · LLM observability · RAG · Vector DB · Fine-tuning · Model serving · Audio & speech |
| Cresta | AI Quality Assurance Intern | Vertical AI | 7 | LLM observability · Guardrails |
| Wiz | AI Security Researcher | Enterprise | 7 | Guardrails |
| LangChain | Education Engineer, Machine Learning | Data AI | 7 | LLM observability · Agent orchestration · Fine-tuning |
| Ramp | Product Manager | Generalist (All Levels) | Fintech | 7 | Fine-tuning · RAG |
| Apple | Applications of ML Engineering Manager | Big Tech | 7 | Guardrails · LLM observability · Agent orchestration · Fine-tuning · Multimodal |
| Uber | Staff Machine Learning Engineer - Ads | Consumer | 7 | Recommender systems · Search & ranking · Model serving · Inference infra |
| Iterable | Senior Machine Learning Engineer (Nova) | Enterprise | 7 | Agent orchestration · RAG · Vector DB · Model serving |
| Manager II, Engineering | Consumer | 7 | Agent orchestration · Guardrails · Recommender systems | |
| Microsoft | PostDoc Researcher-FATE (Fairness, Accountability, Transparency, and Ethics in AI-Microsoft Research | Big Tech | 7 | Guardrails · Interpretability |
| Handshake | Manager, Strategic Projects | Enterprise | 7 | |
| Sierra | Strategist, Agent Development | AI Frontier | 7 | Agent orchestration · Tool use · RAG |
| Wix | Product Manager - AI assistant | Enterprise | 7 | Agent orchestration · LLM observability |
| Stripe | Software Engineer, Fee Insights | Fintech | 7 | Agent orchestration |
| Apple | AIML - Data Scientist, Evaluation | Big Tech | 7 | LLM observability · Fine-tuning · Guardrails |
| OpenAI | Product Manager, Integrity | AI Frontier | 7 | Agent orchestration · Guardrails · LLM observability |
| Outreach | Senior Product Manager, AI | Enterprise | 7 | Agent orchestration · LLM observability · Guardrails |
| Sierra | Software Engineer, Insights | AI Frontier | 7 | Agent orchestration · LLM observability |
| Whatnot | Software Engineer, Trust & Risk | Consumer | 7 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Inference infra · Model serving · Recommender systems · Search & ranking · Vision · Audio & speech · Frontier research · Interpretability · Synthetic data · Agent research · RL post-training · RLHF · Reward modeling · RL robotics · Embodied AI |
| Airbnb | Engineering Manager, Quality Platform | Consumer | 7 | LLM observability · Agent orchestration · Model serving |
| Microsoft | Research Intern - MSR Inclusive Futures Team | Big Tech | 7 | Interpretability |
| Microsoft | Research Intern - Technology for Religious Empowerment | Big Tech | 7 | Agent orchestration |
| Amazon | Senior Language Engineer, Artificial General Intelligence - Data Services | Big Tech | 7 | Synthetic data · Multimodal · Fine-tuning |
| Roblox | Principal Data Scientist - Safety | Consumer | 7 | |
| Figure AI | Safety Systems Architect | Robotics | 7 | Embodied AI · Agent orchestration |
| Databricks | Staff Software Engineer, Search Quality | Data AI | 7 | Search & ranking · Recommender systems · RAG · Vector DB · Multimodal |
| Uber | Senior Program Manager, Tech - Uber AI Solutions | Consumer | 7 | Multimodal |
| Anthropic | Applied AI Architect, Industries | AI Frontier | 7 | LLM observability · Model serving · RAG |
| Apple | Staff Software Engineer (Applied AI) | Big Tech | 7 | Fine-tuning · RAG · Model serving |