AI Scientist - Palo Alto

Mistral AI Mistral AI · AI Frontier · Palo Alto, CA · Research

Mistral AI is seeking an AI Scientist to research and develop novel methods for large language models, working across use cases and modalities. The role involves building tooling for training, evaluation, and analysis of AI models at scale, and collaborating to ship AI systems with real-world impact. Requires strong software engineering skills, experience with AI frameworks or distributed systems, and high engineering competence for production deployment.

What you'd actually do

  1. Research and develop novel methods to push the frontier of large language models
  2. Work across use cases (e.g reasoning, code, agents) and modalities (e.g text, image and speech)
  3. Build tooling and infrastructure to allow training, evaluation and analysis of AI models at scale
  4. Work cross-functionally with other scientists, engineers and product teams to ship AI systems which have a real-world impact

Skills

Required

  • Python
  • PyTorch
  • JAX
  • Ray
  • Kubernetes
  • design complex software
  • production deployment

Nice to have

  • training large transformer models in a distributed fashion
  • fine-tuning
  • evaluation
  • deployment
  • Audio/Speech experience

What the JD emphasized

  • publication record

Other signals

  • frontier research
  • large language models
  • distributed systems
  • AI systems
  • novel methods