Member of Technical Staff, Agents Modeling

Cohere Cohere · AI Frontier · New York, NY · Modeling

Cohere is seeking an experienced ML researcher/engineer to push the frontiers of agentic LLM systems. This role involves exploring and developing agentic techniques, building models for agentic solutions, and working on strategies for training models for advanced agent capabilities like reasoning, tool use, and memory. The role also includes developing data-generation techniques for post-training (SFT and RL*), with direct impacts on Cohere's products.

What you'd actually do

  1. Design and develop novel agentic solutions
  2. Improve upon SOTA on hard agentic tasks
  3. Research the next-generation of on-line learning-from-experience self-improvement
  4. Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
  5. Work with an amazing team of researchers and engineers pushing the boundaries

Skills

Required

  • Strong software engineering skills
  • Proficiency in Python and experience with ML-related code (e.g., pytorch, numpy, etc.)

Nice to have

  • PhD in computer science or related field or similar industry research experience
  • Experience with LLMs and agentic frameworks
  • Experience with post-training LLMs (SFT, PEFT, or RL*)
  • Experience with building synthetic data generation pipelines

What the JD emphasized

  • PhD in computer science or related field or similar industry research experience
  • Experience with LLMs and agentic frameworks
  • Experience with post-training LLMs (SFT, PEFT, or RL*)
  • Experience with building synthetic data generation pipelines

Other signals

  • agentic LLM systems
  • training models for advanced agent capabilities
  • reasoning, tool use, and memory
  • developing data-generation techniques for post-training