Go Live Research Engineer, Machine Learning , Paris/london/zurich/warsaw

Mistral AI Mistral AI · AI Frontier · Paris, France · Research

Research Engineer focused on building and optimizing large-scale learning systems for open-weight models. This role involves accelerating researchers by managing ML pipelines, integrating research with production, conducting experiments on deep learning techniques, and delivering prototypes for production use. The role can be in Platform (shared infra, data pipelines, tooling) or Embedded (within research squads like Alignment, Pre-training, Multimodal, Safety).

What you'd actually do

  1. Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
  2. Interface cutting-edge research with production: integrate checkpoints, streamline evaluation, and expose APIs.
  3. Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
  4. Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
  5. Deliver prototypes that become production-grade components for _Le Chat_ and our enterprise API.

Skills

Required

  • Python
  • PyTorch, JAX or TensorFlow
  • distributed training (DeepSpeed / FSDP / SLURM / K8s)
  • deep learning
  • NLP or LLMs
  • software-design instincts
  • testing
  • code review
  • CI/CD

Nice to have

  • CUDA
  • data-pipeline chops

What the JD emphasized

  • 4 + years working on large-scale ML codebases
  • large-scale ML codebases
  • distributed training

Other signals

  • large-scale ML pipelines
  • cutting-edge research
  • production-grade components