AI Scientist - Warsaw

Mistral AI Mistral AI · AI Frontier · Warsaw, Poland · Research

AI Scientist role at Mistral AI focusing on research and development of novel methods for large language models across various use cases and modalities. The role involves building tooling and infrastructure for training, evaluation, and analysis, and collaborating to ship AI systems with real-world impact. Requires strong software engineering skills, experience with AI frameworks or distributed systems, and high engineering competence for production readiness. Ideal candidates have experience training large transformer models, navigating the MLOps stack, and a strong publication record.

What you'd actually do

  1. Research and develop novel methods to push the frontier of large language models
  2. Work across use cases (e.g reasoning, code, agents) and modalities (e.g text, image and speech)
  3. Build tooling and infrastructure to allow training, evaluation and analysis of AI models at scale
  4. Work cross-functionally with other scientists, engineers and product teams to ship AI systems which have a real-world impact

Skills

Required

  • Python or other, e.g. Rust, Go, Java
  • PyTorch, JAX, or distributed systems (e.g. Ray, Kubernetes)
  • design complex software and make it usable in production
  • self-starter, autonomous and a team player

Nice to have

  • training large transformer models in a distributed fashion
  • fine-tuning, evaluation and deployment
  • strong publication record in a relevant scientific domain

What the JD emphasized

  • push the frontier of large language models
  • training, evaluation and analysis of AI models at scale
  • ship AI systems which have a real-world impact
  • high engineering competence
  • training large transformer models in a distributed fashion
  • full MLOps stack
  • strong publication record

Other signals

  • research and develop novel methods
  • push the frontier of large language models
  • work across use cases and modalities
  • build tooling and infrastructure for training, evaluation, and analysis
  • ship AI systems with real-world impact