AI Tutor - Swedish

xAI xAI · AI Frontier · Remote · Human Data

The AI Tutor role focuses on curating and annotating multilingual audio data to improve Grok's voice interactions, speech recognition, and auditory experiences. This involves providing labels, annotations, and recordings for various languages and accents, collaborating with technical staff to enhance AI's handling of speech nuances, and improving annotation tools. The role requires native Swedish proficiency, strong English skills, excellent auditory perception, and the ability to handle diverse audio content and transcribe accurately. Preferred skills include advanced linguistic understanding, experience with speech datasets, and professional voice work.

What you'd actually do

  1. Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.
  2. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.
  3. Collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
  4. Work with technical staff to improve annotation tools for efficient audio workflows.

Skills

Required

  • Native proficiency in Swedish
  • Proficiency in English (minimum B2 level)
  • Strong auditory perception
  • Ability to handle multilingual audio content
  • Ability to transcribe audio with high accuracy
  • Comfort providing high-quality voice recordings
  • Strong comprehension skills
  • Ability to make independent judgments on ambiguous or varied audio material
  • Strong communication skills
  • Interpersonal skills
  • Analytical skills
  • Detail-oriented
  • Organizational skills

Nice to have

  • Exceptional attention to linguistic nuance, auditory detail, and data quality
  • Deep understanding and taste of what good/useful Audio data is
  • Strong command of advanced transcription and annotation practices
  • Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field
  • Experience working with speech/audio datasets, annotation workflows, or AI training data
  • Knowledge/experience with training voice models
  • Understanding of how data quality impacts model performance
  • Professional experience in voice work, including voice acting, voice recording, podcasting
  • Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work

What the JD emphasized

  • Native proficiency in Swedish
  • Proficiency in English (minimum B2 level)
  • Strong auditory perception
  • Demonstrated ability to handle multilingual audio content
  • Demonstrated ability to transcribe audio with high accuracy
  • Commitment to developing AI that masters sophisticated multilingual audio capabilities.

Other signals

  • multilingual audio data curation
  • speech recognition enhancement
  • voice interaction refinement