AI Frontier · Enterprise LLMs
Cohere has 77 active AI-related job listings. The majority of these roles are focused on agents, representing 39% of the total. Engineering is the dominant function, with 60 positions. The company is actively hiring for roles related to model serving, agent orchestration, and fine-tuning. In the last 30 days, Cohere has posted 17 new AI roles, a significant increase compared to the previous 30-day period.
Currently tracking 69 active AI roles, down 22% versus the prior 4 weeks. Primary focus: Agent · Engineering.
Cohere currently has 78 active AI-related roles in our index. The most common open titles are: Forward Deployed Engineer, Agentic Platform (2), Solutions Architect - Public Sector (2), Applied AI Engineer - Agentic Workflows (Singapore), Applied AI Engineer – Agentic Workflows, Applied AI Engineer – Agentic Workflows (Korea). Most positions are in Engineering and Research.
Cohere's active AI hiring is concentrated in: agents (36%), data (19%), serving infrastructure (18%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Cohere is hiring AI talent in: Canada (34 roles), United States (16 roles), United Kingdom (14 roles), France (3 roles).
Job postings at Cohere most frequently reference: model serving, agent orchestration, fine tuning, inference infra, rag.
In the past 30 days, Cohere has posted 9 new AI-related roles. That is a -40% change versus the prior 30 days (15 → 9).
| Title | Stage | AI score |
|---|---|---|
| Product Manager, Safety Research Product Manager for Safety Research at Cohere, focusing on bridging AI safety research with the North agentic AI platform. The role involves translating research findings into product-level safety features, defining the safety product roadmap, partnering with modeling teams on evaluations, and coordinating the development of guardrails and intervention mechanisms. Requires technical depth to engage with researchers and product instincts for translating insights into actionable product strategies. | ShipEval Gate | 9 |
| Member of Technical Staff, Senior/Staff MLE Cohere is seeking a Senior/Staff Member of Technical Staff, Applied ML to work directly with enterprise customers on problems that push LLMs to their limits. This role involves designing custom LLM solutions, delivering production-ready models, and training/customizing frontier models using Cohere's full stack. The position also influences Cohere's foundation models and requires operating with early-startup level ownership. Responsibilities include technical leadership, solution design, modeling, customization, customer-facing impact, and team mentorship. |
| Post-trainAgent |
| 9 |
| Member of Technical Staff, MLE This role focuses on applying and customizing Cohere's frontier LLMs for enterprise customers, involving post-training, retrieval, and agent integrations. The individual will design and deliver production-ready models, influence the development of foundation models, and operate with significant ownership, combining application, research, and customer-facing engineering. | Post-trainAgent | 9 |
| Applied AI Engineer – Agentic Workflows Cohere is seeking an Applied AI Engineer to build production-grade AI agents for enterprise customers. This role involves designing, building, and deploying agentic workflows powered by LLMs, integrating them with tools, APIs, and data sources. The engineer will focus on reliability, observability, safety, and audibility, working closely with customers and shaping how agentic systems are built and deployed. | Agent | 9 |
| Staff Research Engineer, Model Efficiency Cohere is seeking a Staff Research Engineer focused on Model Efficiency to push the limits of LLM inference efficiency. This role involves exploring and shipping breakthroughs in model architecture, routing optimization, decoding algorithms, software/hardware co-design for GPU acceleration, and performance optimization without compromising model quality. The goal is to improve how fast and efficiently their foundation models run in production. | ServePretrain | 9 |
| Member of Technical Staff, Model Efficiency Cohere is seeking an engineer to improve LLM inference efficiency by optimizing model execution, reducing latency and increasing throughput. This role involves deep dives into model execution, identifying bottlenecks, and developing optimizations across the inference stack, including GPU/CUDA and kernel-level improvements. | Serve | 9 |
| Member of Technical Staff, Agents Modeling Cohere is seeking an experienced ML researcher/engineer to push the frontiers of agentic LLM systems. This role involves exploring and developing agentic techniques, building models for agentic solutions, and working on strategies for training models for advanced agent capabilities like reasoning, tool use, and memory. The role also includes developing data-generation techniques for post-training (SFT and RL*), with direct impacts on Cohere's products. | AgentPost-train | 9 |
| Member of Technical Staff, Search Cohere is seeking a Member of Technical Staff for their Search team to develop and improve state-of-the-art models for information retrieval, focusing on training embedding and reranker models. The role involves gathering and optimizing retrieval datasets, collaborating with serving and product teams, and engaging in research collaborations with a focus on publishing work. | DataPost-train | 9 |
| Senior Member of Technical Staff, Multimodal AI Cohere is seeking a Senior Member of Technical Staff to focus on Multimodal AI. This role involves designing and developing cutting-edge multimodal AI systems integrating text, speech, and vision. The candidate will conduct research and experiments on advanced compute infrastructure, exploring novel ideas in multimodal representation learning and transfer learning. The role requires strong software engineering skills, proficiency in Python and deep learning frameworks (JAX, PyTorch, TensorFlow), and knowledge of distributed training strategies for large-scale multimodal models. Experience with autoregressive models for tasks like image/video captioning and speech-to-text is beneficial. The ideal candidate enjoys tuning and optimizing large multimodal models and building evaluations to measure their performance. | Post-trainAgent | 9 |
| Head of Solutions Architecture Head of Solutions Architecture role focused on leading a global team to drive sales and revenue growth by designing and delivering enterprise AI solutions, with a strong emphasis on agentic AI and model customization. | Agent | 8 |
| Lead Member of Technical Staff, Inference Infrastructure Lead Member of Technical Staff, Inference Infrastructure at Cohere. Responsible for the design, deployment, and operation of the AI platform delivering large language models through API endpoints. Focuses on optimizing NLP models for low latency, high throughput, and high availability, with a strong emphasis on Kubernetes, GPU workloads, and multi-cloud environments. Requires extensive experience in production infrastructure, distributed systems, and technical leadership, including mentoring engineers and guiding strategic infrastructure decisions. | Serve | 8 |
| Data Engineer Cohere is seeking a Data Engineer to work on foundational infrastructure for AI systems, including storage, product launches, and customer experiences. The role involves collaborating with researchers and engineers, running implementations end-to-end, and partnering across departments to define growth strategies. The ideal candidate has 5+ years of experience in production-grade data processing systems, strong Python and SQL skills, and experience with distributed data processing frameworks. | Data | 8 |
| Solutions Architect - Public Sector Solutions Architect for Cohere's Public Sector business, focusing on technical pre-sales and post-sales. This role involves understanding customer problems, mapping them to Cohere solutions, building PoCs, and acting as a trusted technical advisor. Requires a Security Clearance and experience architecting/deploying NLP/AI/LLM solutions. | Agent | 8 |
| Staff Software Engineer, Inference Infrastructure Cohere is seeking a Staff Software Engineer to join their Model Serving team. This role focuses on developing, deploying, and operating the AI platform that delivers Cohere's large language models via API endpoints. The engineer will optimize NLP models for low latency, high throughput, and high availability, working with distributed systems, Kubernetes, and GPU workloads. Experience with cloud platforms and high-performance languages is required. | Serve | 8 |
| Audio Inference Engineer, Model Efficiency Cohere is seeking an Audio Inference Engineer to optimize audio inference serving efficiency, focusing on latency, throughput, and quality for real-time and streaming audio workloads. The role involves deep system analysis, bottleneck identification, and developing creative solutions for audio processing and inference. | ServePost-train | 8 |
| Software Engineer, Data Infrastructure Software Engineer to build and maintain the high-performance data layer for AI training and evaluation workloads, working on petabyte-scale storage infrastructure and distributed data processing. | Data | 7 |