3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| JPMorgan Chase | Applied AIML -Executive Director | Banking | 8 | Recommender systems · Search & ranking · RAG · Model serving · Evals · Guardrails |
| ElevenLabs | Safety Engineer | AI Frontier | 8 | Guardrails · Agent orchestration · Model serving · LLM observability · Multimodal |
| JPMorgan Chase | Applied AIML Associate- Python & Data Science Engineering | Banking | 8 | RAG · Vector DB · Model serving |
| Cohere | Lead Member of Technical Staff, Inference Infrastructure | AI Frontier | 8 | Model serving |
| Research Engineer, Deployment Performance, Robotics, DeepMind | Big Tech | 8 | Model serving | |
| Staff Software Engineer, Content Safety, Infrastructure | Big Tech | 8 | Agent orchestration · Multimodal · Model serving · Guardrails · LLM observability · Evals | |
| Senior Staff Software Engineer, Infrastructure, Agents Infra | Big Tech | 8 | Agent orchestration · Model serving · Vision · Multimodal | |
| Amazon | Senior Software Development Engineer - AI Mftg & Automation, Advanced Manufacturing Engineering (AME) | Big Tech | 8 | Agent orchestration · Vision · Multimodal · Model serving |
| Johnson & Johnson | Senior Scientist, Computer Vision | Pharma | 8 | Vision · Multimodal · Model serving |
| NVIDIA | Senior Software Engineer - VLM Microservices for Neural Reconstruction | Semiconductors | 8 | Vision · Multimodal · Model serving · Fine-tuning |
| NVIDIA | AI Computing Software Development Engineer, TensorRT | Semiconductors | 8 | Model serving |
| Salesforce | Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack | Enterprise | 8 | Model serving · Training infra · LLM observability |
| NVIDIA | Senior Solutions Architect, Generative AI | Semiconductors | 8 | Model serving · Recommender systems |
| Snowflake | Staff Software Engineer, Cortex AI Infrastructure | Data AI | 8 | Agent orchestration · RAG · Vector DB · Evals · Guardrails · LLM observability · Model serving |
| Uber | Senior Software Engineer - AV Labs | Consumer | 8 | Embodied AI |
| Mistral AI | Applied AI, Senior/Staff Forward Deployed Machine Learning Engineer - Morocco | AI Frontier | 8 | Fine-tuning · RAG · Vector DB · Agent orchestration · LLM observability · Model serving |
| Axon | Sr. Full Stack Member of Technical Staff | Enterprise | 8 | Multimodal · Agent orchestration · Model serving |
| Software Engineering Manager, Cloud ML Compute Services (Mandarin, English) | Big Tech | 8 | Model serving · Fine-tuning · Evals | |
| NVIDIA | Principal Cloud Services Software Engineer | Semiconductors | 8 | Model serving |
| NVIDIA | Principal AI and ML Infra Software Engineer, GPU Clusters | Semiconductors | 8 | Model serving |
| Comcast | Machine Learning Engineer 4 | Media | 8 | Agent orchestration · LLM observability · Evals · Guardrails · Model serving |
| Salesforce | Software Engineering PMTS | Enterprise | 8 | Agent orchestration · LLM observability · RAG · Vector DB · Fine-tuning · Model serving |
| Sigma Computing | Staff AI/ML Engineer | Data AI | 8 | Agent orchestration · Tool use · Model serving · Fine-tuning · Multimodal |
| Expedia | Senior Software Development Engineer (GenAI, Agentic AI) | Hospitality | 8 | Agent orchestration · Tool use · RAG · Vector DB · Fine-tuning · Model serving · LLM observability · Guardrails |
| NVIDIA | Compiler Engineer - AI Inference | Semiconductors | 8 | Model serving |
| Verkada | AI Software Engineering Intern - Fall 2026 | Enterprise | 8 | Multimodal · Vision · Audio & speech · Model serving |
| NVIDIA | Senior Software Engineer, Metropolis Vision AI | Semiconductors | 8 | Vision · Model serving · Multimodal · Synthetic data |
| NVIDIA | Senior Software Engineer, AI Networking | Semiconductors | 8 | Model serving · Agent orchestration · Agent research |
| Capital One | Manager, Data Science - GenAI Digital Assistant | Banking | 8 | Agent orchestration · Fine-tuning · Model serving · RAG · Vector DB · LLM observability |
| Baseten | Software Engineer - Voice AI (Inference Runtime) | Data AI | 8 | Model serving · Audio & speech · Agent orchestration · Tool use |