Research Internship (fall / Winter 2026)

Cohere Cohere · AI Frontier · Canada · Internships

Cohere is seeking Research Interns to collaborate on designing and implementing novel research ideas and shipping state-of-the-art models to production. Interns will conduct cutting-edge machine learning research, build and train large language models, and focus on expanding the frontier of knowledge in language modeling and related areas. Research results will be disseminated through publications, datasets, and code, contributing to practical applications in Cohere's product development.

What you'd actually do

  1. Conduct cutting-edge machine learning research, building and training large language models.
  2. Focus on research projects aimed at expanding the frontier of knowledge in language modelling and associate areas such as evaluation, multimodal models, optimisation etc.
  3. Disseminate your research results through the production of publications, datasets, and code.
  4. Contribute to research initiatives that have practical applications in Cohere’s product development.

Skills

Required

  • Python
  • C
  • C++
  • Lua
  • JAX
  • Pytorch
  • Tensorflow
  • experience using large-scale distributed training strategies
  • data annotation and evaluation pipelines
  • implementing state of the art ML models
  • autoregressive sequence models, such as Transformers
  • strong communication and problem-solving skills

Nice to have

  • Demonstrated expertise through publications in top tier venues in fields such as machine learning, NLP, artificial intelligence, computer vision, optimization, computer science, statistics, applied mathematics, or data science.
  • Proven ability to tackle analytical problems using quantitative methodologies.
  • Proficiency in handling and analysing complex, high-dimensional data from various sources.
  • Experience in applying theoretical and empirical research to real-world problem-solving.

What the JD emphasized

  • PhD in Machine Learning, NLP, or a related discipline
  • full-time internship that lasts for 4-6 months
  • publications in top tier venues

Other signals

  • training large language models
  • expanding the frontier of knowledge in language modelling
  • publications, datasets, and code