AI Tutor - French

xAI xAI · AI Frontier · Remote · Human Data

The AI Tutor - French role at xAI focuses on training and refining AI models, specifically Grok, for multilingual audio capabilities. This involves curating and annotating audio data in French and English, focusing on speech recognition, voice interactions, and auditory experiences across diverse languages and accents. The goal is to enhance Grok's global accessibility and improve its handling of multilingual audio nuances.

What you'd actually do

  1. Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.
  2. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.
  3. Collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
  4. Work with technical staff to improve annotation tools for efficient audio workflows.

Skills

Required

  • Native proficiency in French
  • Proficiency in English (minimum B2 level)
  • Strong auditory perception
  • Ability to handle multilingual audio content
  • Ability to transcribe audio with high accuracy
  • Comfort providing high-quality voice recordings and feedback
  • Strong comprehension skills
  • Strong communication, interpersonal, analytical, detail-oriented, and organizational skills

Nice to have

  • Exceptional attention to linguistic nuance, auditory detail, and data quality
  • Deep understanding and taste of what good/useful Audio data is
  • Strong command of advanced transcription and annotation practices
  • Background in linguistics, speech sciences, cognitive science, or related field
  • Experience working with speech/audio datasets, annotation workflows, or AI training data
  • Professional experience in voice work (voice acting, recording, podcasting)
  • Ability to exercise independent judgment in ambiguous audio scenarios
  • Portfolio of voice samples, annotated transcripts, or audio-related work

What the JD emphasized

  • native proficiency in French
  • proficiency in English
  • strong auditory perception
  • demonstrated ability to handle multilingual audio content
  • demonstrated ability to transcribe audio with high accuracy
  • comfort providing high-quality voice recordings and feedback
  • strong comprehension skills
  • strong communication, interpersonal, analytical, detail-oriented, and organizational skills
  • commitment to developing AI that masters sophisticated multilingual audio capabilities
  • exceptional attention to linguistic nuance, auditory detail, and data quality
  • deep understanding and taste of what good/useful Audio data is
  • strong command of advanced transcription and annotation practices
  • background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience
  • experience working with speech/audio datasets, annotation workflows, or AI training data
  • professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production
  • demonstrated ability to exercise independent judgment in ambiguous audio scenarios
  • portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail
  • candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply

Other signals

  • training and refining Grok to excel in voice interactions
  • curating and annotating high-quality audio data
  • enhance Grok's global accessibility
  • bridging language barriers through accurate speech processing