Applied Scientist, Prime Video - Content Localization, Understanding & Enrichment

Amazon Amazon · Big Tech · Seattle, WA · Applied Science

This role focuses on research and development of speech and audio generation technology, including end-to-end speech-to-speech architecture and audio processing solutions. The scientist will define research roadmaps, publish findings, and develop deep learning algorithms, with a focus on computer vision algorithms. The role involves building models for business applications and potentially mentoring/hiring other scientists.

What you'd actually do

  1. Lead research and development of speech and audio generation technology and end-to-end speech-to-speech architecture
  2. Develop audio processing solutions for production environments, including source separation, enhancement, and mixing
  3. Define the research roadmap for your area, identify high-impact problems, and communicate technical direction to senior leadership
  4. Publish research, contribute to the broader scientific community, and bring external advances into production systems
  5. Hire, mentor, and develop applied scientists. Grow the team's capabilities to meet evolving customer and business needs

Skills

Required

  • building models for business application
  • deep learning algorithms
  • computer vision algorithms
  • Java
  • C++
  • Python

Nice to have

  • Unix/Linux
  • professional software development
  • publications at top-tier peer-reviewed conferences or journals

What the JD emphasized

  • Lead research and development
  • Develop audio processing solutions
  • Define the research roadmap
  • Publish research
  • Hire, mentor, and develop applied scientists
  • 3+ years of building models for business application experience
  • PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
  • Experience developing and implementing deep learning algorithms, particularly with respect to computer vision algorithms
  • Have publications at top-tier peer-reviewed conferences or journals

Other signals

  • speech and audio generation technology
  • end-to-end speech-to-speech architecture
  • audio processing solutions
  • Define the research roadmap
  • Publish research
  • Develop and implementing deep learning algorithms