AI Tutor - Tamil

xAI xAI · AI Frontier · Remote · Human Data

The AI Tutor - Tamil role at xAI focuses on training and refining AI models, specifically Grok, for multilingual audio capabilities. This involves curating and annotating high-quality audio data in Tamil and English to improve voice interactions, speech recognition, and auditory experiences globally. Responsibilities include labeling, annotating, and recording audio clips, collaborating with technical staff to enhance AI's handling of speech nuances, and improving annotation tools.

What you'd actually do

  1. Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.
  2. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.
  3. Collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
  4. Work with technical staff to improve annotation tools for efficient audio workflows.

Skills

Required

  • Native proficiency in Tamil
  • Proficiency in English (B2 level)
  • Strong auditory perception
  • Ability to handle multilingual audio content
  • Ability to transcribe audio with high accuracy
  • Ability to provide high-quality voice recordings
  • Strong comprehension skills
  • Independent judgment on ambiguous audio material
  • Strong communication skills
  • Interpersonal skills
  • Analytical skills
  • Detail-oriented
  • Organizational skills

Nice to have

  • Exceptional attention to linguistic nuance
  • Deep understanding of good/useful Audio data
  • Strong command of advanced transcription and annotation practices
  • Background in linguistics, speech sciences, cognitive science, or related field
  • Experience working with speech/audio datasets
  • Knowledge/experience with training voice models
  • Professional experience in voice work (voice acting, recording, podcasting)
  • Portfolio of voice samples or annotated transcripts

What the JD emphasized

  • Native proficiency in Tamil with exposure to diverse accents, dialects, or regional variations.
  • Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.
  • Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.
  • Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.
  • Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.
  • Commitment to developing AI that masters sophisticated multilingual audio capabilities.

Other signals

  • multilingual audio data curation
  • speech recognition training
  • voice interaction refinement