AI Tutor - Urdu

xAI xAI · AI Frontier · Remote · Human Data

The role involves training and refining AI models for multilingual audio capabilities, specifically focusing on voice interactions, speech recognition, and auditory experiences. Responsibilities include curating and annotating audio data, providing labels and inputs on multilingual audio clips, and collaborating with technical staff to improve AI's handling of speech modulation, accent variation, and noise in real-world recordings. The goal is to enhance global accessibility and enable natural spoken interactions.

What you'd actually do

  1. Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.
  2. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.
  3. Collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
  4. Work with technical staff to improve annotation tools for efficient audio workflows.

Skills

Required

  • Urdu native proficiency
  • English proficiency (B2+)
  • Auditory perception
  • Multilingual audio handling
  • Accurate audio transcription
  • Voice recording and feedback
  • Independent judgment on audio material
  • Communication skills
  • Interpersonal skills
  • Analytical skills
  • Detail-oriented
  • Organizational skills

Nice to have

  • Exceptional attention to linguistic nuance
  • Deep understanding of good/useful Audio data
  • Advanced transcription and annotation practices
  • Background in linguistics, speech sciences, cognitive science
  • Experience with speech/audio datasets and annotation workflows
  • Experience training voice models
  • Professional experience in voice work (acting, recording, podcasting)
  • Portfolio of voice samples or annotated transcripts

What the JD emphasized

  • Native proficiency in Urdu
  • Proficiency in English (minimum B2 level)
  • Strong auditory perception
  • Demonstrated ability to handle multilingual audio content
  • Demonstrated ability to transcribe audio with high accuracy
  • Comfort providing high-quality voice recordings and feedback
  • Strong comprehension skills
  • Commitment to developing AI that masters sophisticated multilingual audio capabilities.

Other signals

  • training and refining Grok to excel in voice interactions
  • curating and annotating high-quality audio data
  • enhancing Grok's global accessibility
  • bridging language barriers through accurate speech processing