Senior Research Scientist

Adobe Adobe · Enterprise · San Francisco, CA

Senior Research Scientist at Adobe Speech AI Lab focusing on speech generative AI, audio modeling, and multimodal learning. The role involves leading independent research, designing and advancing state-of-the-art AI models and training procedures, including foundation models and generative systems. Emphasis on modern large-scale model architectures, publishing research, and building prototypes for product integration.

What you'd actually do

  1. Lead and execute independent research in speech generative AI, audio modeling, and multimodal learning (text and visuals), with a strong focus on modern large-scale model architectures.
  2. Design, analyze, and advance state-of-the-art AI models and training procedures, including foundation models, representation learning, and generative systems for speech and audio.
  3. Maintain strong technical currency with the latest developments in model architectures, optimization techniques, data scaling, and training strategies, and apply them creatively to audio and multimodal problems.
  4. Publish research at leading conferences and journals and pursue patent protection for impactful innovations.
  5. Build high-quality research prototypes that demonstrate technical rigor, scalability, and clear paths to product integration.

Skills

Required

  • Ph.D (preferred). or Master’s degree in Computer Science, Electrical Engineering, or a related field, with a strong research focus in audio, speech, and machine learning.
  • 3+ years of research experience in industry strongly preferred.
  • Proven track record of independent research contributions, such as first-author publications or equivalent leadership in research projects.
  • Deep expertise in audio and machine learning, including strong intuition for: Speech and audio generation, Audio representations and modeling, Training large-scale neural models
  • Hands-on experience with modern AI architectures and training pipelines, and a demonstrated ability to quickly adopt and extend new techniques.
  • Strong Python and deep learning development skills, with attention to performance, reproducibility, and experimental rigor.
  • Excellent communication skills and the ability to clearly articulate complex technical ideas.
  • A clear appetite for real-world impact, coupled with a commitment to technical excellence and high research standards.

Nice to have

  • multimodal learning (text and visuals)

What the JD emphasized

  • independent research leadership
  • technical excellence
  • publication at top venues
  • rapid iteration from research insight to deployed technology
  • Proven track record of independent research contributions, such as first-author publications or equivalent leadership in research projects.
  • Deep expertise in audio and machine learning
  • Hands-on experience with modern AI architectures and training pipelines
  • commitment to technical excellence and high research standards

Other signals

  • speech generative AI
  • multimodal learning
  • large-scale model architectures
  • foundation models
  • generative systems for speech and audio
  • research prototypes