Member of Technical Staff, Integration/rl Team (research Engineer)

Cohere Cohere · AI Frontier · Paris, France · Modeling

Cohere is hiring a Member of Technical Staff for their Integration/RL Team, focusing on developing and scaling machine learning algorithms and infrastructure for LLM post-training, particularly large-scale, distributed RL methods. The role involves enhancing the post-training codebase, implementing new research tools, optimizing algorithms, and scaling distributed RL.

What you'd actually do

  1. Design and write high-performing and scalable software for training models.
  2. Develop new tools to support and accelerate research and LLM training.
  3. Coordinate with other engineering teams (Infrastructure, Efficiency, Serving) and the scientific teams (Agent, Multimodal, Multilingual, etc.) to create a strong and integrated post-training ecosystem.
  4. Craft and implement techniques to improve performance and speed up our training cycles, both on SFT, offline preference, and the RL regime.
  5. Research, implement, and experiment with ideas on our cluster and data infrastructure.

Skills

Required

  • Python
  • JAX
  • Pytorch
  • XLA/MLIR
  • large-scale distributed training strategies
  • software engineering

Nice to have

  • Kubernetes
  • Ray
  • post-training phase of model training
  • ML, LLM and RL academic research

What the JD emphasized

  • Extremely strong software engineering skills
  • large-scale distributed training strategies
  • post-training phase of model training

Other signals

  • LLM post-training
  • distributed RL
  • scaling algorithms
  • production code