Research Scientist, Sound Understanding, Deepmind

Google Google · Big Tech · Mountain View, CA +3

Research Scientist on the Sound team within Google DeepMind Frontier AI, focused on advancing research in sound understanding, joint audio-video generation, and audio editing, contributing to the next generation of generative AI technology. The role involves improving model quality, unlocking new audio capabilities, developing evaluation methods, and publishing research.

What you'd actually do

  1. Improve quality of models for audio understanding and generation, including research on architectures, representations, training losses and paradigms, and test-time techniques for improved generation quality and efficiency.
  2. Unlock new audio capabilities in foundational models, both in pre-training and post-training data pipelines.
  3. Develop better evaluation methods (human evaluation, auto raters, automated metrics) to measure quality of open-ended audio tasks.
  4. Publish research at venues and contribute to Google DeepMind products.
  5. Collaborate across teams to advance research in sound understanding, joint audio-video generation, and audio editing.

Skills

Required

  • PhD in Computer Science or related field
  • Experience with text, image, video, or audio generation
  • Experience in AI/ML
  • Experience with Generative AI

Nice to have

  • PhD in AI/ML or related technical field
  • Publication record
  • Experience developing/launching products with LLMs

What the JD emphasized

  • PhD degree in Computer Science, a related field, or equivalent practical experience.
  • Experience with text, image, video, or audio generation.
  • Experience in Artificial Intelligence or Machine Learning.
  • Experience with Generative AI.

Other signals

  • DeepMind
  • Frontier AI
  • generative AI technology
  • audio understanding
  • audio generation
  • joint audio-video generation
  • audio editing