Research Engineer, Machine Learning - Paris/london/zurich/warsaw

Mistral AI Mistral AI · AI Frontier · Paris, France · Research

Research Engineer at Mistral AI focused on building and optimizing large-scale learning systems for open-weight models. The role involves working with Research Scientists to enhance training frameworks, data pipelines, and cluster tooling, or to integrate cutting-edge research into repeatable, scalable code within research squads (Alignment, Pre-training, Multimodal, Safety). Responsibilities include accelerating researchers, interfacing research with production, conducting large-scale experiments, designing/implementing ML algorithms, and delivering prototypes for production.

What you'd actually do

  1. Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
  2. Interface cutting-edge research with production: integrate checkpoints, streamline evaluation, and expose APIs.
  3. Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
  4. Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
  5. Deliver prototypes that become production-grade components for _Le Chat_ and our enterprise API.

Skills

Required

  • Master’s or PhD in Computer Science (or equivalent proven track record)
  • 4 + years working on large-scale ML codebases
  • PyTorch, JAX or TensorFlow
  • distributed training (DeepSpeed / FSDP / SLURM / K8s)
  • deep learning
  • NLP
  • LLMs
  • Python
  • testing
  • code review
  • CI/CD

Nice to have

  • CUDA
  • data-pipeline chops

What the JD emphasized

  • 4 + years working on large-scale ML codebases
  • large-scale ML pipelines
  • distributed training on thousands of GPUs

Other signals

  • large-scale learning systems
  • open-weight models
  • distributed training on thousands of GPUs
  • deep learning
  • NLP
  • LLMs