About Mistral
Mistral provides full-stack AI solutions: from frontier models to developer tools, applications, and compute. We partner with enterprises tackling the hardest problems—across high-stakes industries like finance, manufacturing, defense, healthcare, and the public sector—co-creating customized AI systems that they can run on their terms.
We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between Europe, North America, Asia and the Middle East. We are creative, low-ego and team-spirited.
The Role
As an AI Scientist, you will push the boundaries of AI by developing and scaling cutting-edge models. This role exists to advance Mistral’s mission of democratizing frontier intelligence, enabling businesses and consumers to benefit from open, high-impact AI systems. You will collaborate with a global, cross-functional team to design, build, and deploy models that solve real-world problems across reasoning, code, and multimodal applications. Your work will directly shape how AI integrates into daily life and enterprise operations.
What You Will Do
- Research and develop novel methods to advance the capabilities of LLMs.
- Design and implement tooling and infrastructure for training, evaluating, and analysing AI models at scale.
- Collaborate with scientists, engineers, and product teams to ship AI systems with tangible impact.
- Explore and innovate across use cases such as reasoning, code generation, and agent-based systems.
- Work with multimodal data, including text, image, and speech, to expand model capabilities.
- Optimise distributed systems to improve the efficiency and scalability of model training.
- Contribute to the open-source community by releasing frontier models for public use.
What We're Looking For
- A track record of contributing to relevant scientific domains, such as publications or open-source projects.
- Hands-on experience with AI frameworks like PyTorch or JAX, or distributed systems such as Ray or Kubernetes.
- Proficiency in at least one programming language, such as Python, Rust, Go, or Java.
- Ability to design complex software systems and transition them into production environments.
- Experience with training large transformer models in a distributed setting.
- Familiarity with the full MLOps stack, including fine-tuning, evaluation, and deployment.
- A self-starter mindset with the autonomy to drive projects forward and collaborate effectively in a team.
- Strong engineering competence, with a focus on building scalable and reliable systems.
What we offer
We offer a comprehensive benefits package designed to support your well-being, growth, and work-life balance. Benefits vary by country and may include healthcare coverage, parental leave, retirement plans, relocation support, wellness programs, meal and transportation allowances, and other location-specific perks.
For the most up-to-date details on benefits available in your location, please refer to our Benefits page.
Privacy Policy
Your privacy matters to us. You can learn more about how we handle your personal data in our Applicant Privacy Policy.