About Mistral
Mistral provides full-stack AI solutions: from frontier models to developer tools, applications, and compute. We partner with enterprises tackling the hardest problems—across high-stakes industries like finance, manufacturing, defense, healthcare, and the public sector—co-creating customized AI systems that they can run on their terms.
We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between Europe, North America, Asia and the Middle East. We are creative, low-ego and team-spirited.
The Role
As an AI Scientist,** ** you will research and develop novel methods to advance the capabilities of large language models. This role exists to push the boundaries of what AI can achieve, enabling Mistral to deliver systems that transform how businesses operate and integrate into daily life.
You will join a team of engineers and scientists focused on building tooling and infrastructure to train, evaluate, and analyse AI models at scale. Your work will span multiple use cases and modalities, from reasoning and code generation to text, image, and speech processing.
The impact of this role is far-reaching, as you will collaborate cross-functionally to ship AI systems that have real-world applications and open-source frontier models for global benefit.
What You Will Do
- Research and develop innovative methods to advance the frontier of large language models.
- Build tooling and infrastructure to enable large-scale training, evaluation, and analysis of AI models.
- Collaborate with scientists, engineers, and product teams to deploy AI systems with tangible real-world impact.
- Design and implement solutions that address complex challenges in AI model development and deployment.
- Contribute to the open-source community by releasing cutting-edge models and tools.
What We're Looking For
- A strong publication record in a relevant scientific domain, such as machine learning or natural language processing.
- High proficiency as a software engineer in at least one programming language, such as Python, Rust, Go, or Java.
- Hands-on experience with AI frameworks like PyTorch or JAX, or distributed systems such as Ray or Kubernetes.
- A self-starter mindset, with the autonomy to drive projects forward and the ability to work effectively in a team.
- Experience with training large transformer models in a distributed manner.
- Familiarity with the full MLOps stack, including fine-tuning, evaluation, and deployment.
- Experience with audio or speech processing, including input/output and NLP.
What we offer
We offer a comprehensive benefits package designed to support your well-being, growth, and work-life balance. Benefits vary by country and may include healthcare coverage, parental leave, retirement plans, relocation support, wellness programs, meal and transportation allowances, and other location-specific perks.
For the most up-to-date details on benefits available in your location, please refer to our Benefits page.
Privacy Policy
Your privacy matters to us. You can learn more about how we handle your personal data in our Applicant Privacy Policy.