Research Internship (spring/summer 2026)

Cohere Cohere · AI Frontier · Canada · Internships

Cohere is seeking Research Interns to collaborate on designing and implementing novel research ideas and shipping state-of-the-art models to production. Interns will conduct cutting-edge ML research, build and train LLMs, focus on expanding the frontier of knowledge in language modeling and related areas, and disseminate research results through publications, datasets, and code. The role involves contributing to research initiatives with practical applications in product development.

What you'd actually do

  1. Conduct cutting-edge machine learning research, building and training large language models.
  2. Focus on research projects aimed at expanding the frontier of knowledge in language modelling and associate areas such as evaluation, multimodal models, optimisation etc.
  3. Disseminate your research results through the production of publications, datasets, and code.
  4. Contribute to research initiatives that have practical applications in Cohere’s product development.

Skills

Required

  • PhD in Machine Learning, NLP, or a related discipline (or exceptional non-PhD candidates)
  • Available for a full-time internship (4-6 months)
  • Experience using large-scale distributed training strategies
  • Experience with data annotation and evaluation pipelines
  • Experience implementing state-of-the-art ML models
  • Familiarity with autoregressive sequence models (e.g., Transformers)
  • Strong communication and problem-solving skills
  • Knowledge of programming languages such as Python, C, C++, Lua
  • Knowledge of ML frameworks such as JAX, PyTorch, and TensorFlow
  • Previous experience in building systems based on ML and deep learning techniques
  • Passion for applied NLP models and products

Nice to have

  • Publications in top-tier venues (ML, NLP, AI, Computer Vision, Optimization, CS, Statistics, Applied Math, Data Science)
  • Ability to tackle analytical problems using quantitative methodologies
  • Proficiency in handling and analyzing complex, high-dimensional data
  • Experience applying theoretical and empirical research to real-world problem-solving

What the JD emphasized

  • must be currently pursuing a PhD in Machine Learning, NLP, or a related discipline
  • available for a full-time internship that lasts for 4-6 months
  • building and training large language models
  • expanding the frontier of knowledge in language modelling
  • production of publications, datasets, and code
  • practical applications in Cohere’s product development
  • pursuing, or in the process of obtaining, a PhD in Machine Learning, NLP, Artificial Intelligence, or a related discipline
  • exceptional non-PhD candidates
  • large-scale distributed training strategies
  • data annotation and evaluation pipelines
  • implementing state of the art ML models
  • autoregressive sequence models, such as Transformers
  • strong communication and problem-solving skills
  • convey complex research findings clearly and succinctly
  • programming languages such as Python, C, C++, Lua, or related languages
  • ML frameworks such as JAX, Pytorch and Tensorflow
  • building systems based on machine learning and deep learning techniques
  • applied NLP models and products
  • Demonstrated expertise through publications in top tier venues
  • tackle analytical problems using quantitative methodologies
  • handling and analysing complex, high-dimensional data
  • applying theoretical and empirical research to real-world problem-solving

Other signals

  • training large language models
  • expanding the frontier of knowledge in language modelling
  • publications, datasets, and code