AI Scientist - Audio

Mistral AI Mistral AI · AI Frontier · Paris, France · Research

Mistral AI is seeking an AI Scientist specializing in Audio to research and develop novel methods for large language models, focusing on speech input/output. The role involves building tooling for training and evaluation, working across modalities and use cases, and shipping AI systems with real-world impact. Requires expertise in speech methodologies, strong software engineering skills, and experience with AI frameworks or distributed systems.

What you'd actually do

  1. Research and develop novel methods to push the frontier of large language models
  2. Work across use cases (e.g reasoning, code, agents) and modalities (e.g text, image and speech)
  3. Build tooling and infrastructure to allow training, evaluation and analysis of AI models at scale
  4. Work cross-functionally with other scientists, engineers and product teams to ship AI systems which have a real-world impact

Skills

Required

  • speech input/output methodologies
  • Python
  • PyTorch
  • JAX
  • Ray
  • Kubernetes
  • software engineering
  • distributed systems

Nice to have

  • large-scale speech-language models
  • training large transformer models in a distributed fashion
  • fine-tuning
  • evaluation
  • deployment
  • publication record

What the JD emphasized

  • expert in speech input/output methodologies (specific to audio)
  • highly proficient software engineer in at least one programming language (Python or other, e.g. Rust, Go, Java)
  • hands-on experience with AI frameworks (e.g. PyTorch, JAX) or distributed systems (e.g. Ray, Kubernetes)
  • high engineering competence. This means being able to design complex software and make it usable in production

Other signals

  • frontier research
  • large language models
  • speech
  • training
  • distributed systems