Go Live AI Scientist - Warsaw

Mistral AI Mistral AI · AI Frontier · Warsaw, Poland · Research

Mistral AI is seeking an AI Scientist in Warsaw to research and develop novel methods for large language models, working across use cases and modalities. The role involves building tooling for training, evaluation, and analysis of AI models at scale, and collaborating with cross-functional teams to ship AI systems with real-world impact. Requires strong software engineering skills, experience with AI frameworks or distributed systems, and production deployment capabilities.

What you'd actually do

  1. Research and develop novel methods to push the frontier of large language models
  2. Work across use cases (e.g reasoning, code, agents) and modalities (e.g text, image and speech)
  3. Build tooling and infrastructure to allow training, evaluation and analysis of AI models at scale
  4. Work cross-functionally with other scientists, engineers and product teams to ship AI systems which have a real-world impact

Skills

Required

  • Python
  • PyTorch
  • JAX
  • Ray
  • Kubernetes

Nice to have

  • training large transformer models in a distributed fashion
  • fine-tuning
  • evaluation
  • deployment
  • strong publication record

What the JD emphasized

  • high engineering competence
  • production

Other signals

  • frontier models
  • developer tools
  • compute
  • enterprise AI systems