Staff Research Engineer, Gemini Audio, Deepmind

Google Google · Big Tech · Mountain View, CA +2

Staff Research Engineer focused on Gemini Audio at DeepMind, developing multimodal models (audio/audio-visual) and driving end-to-end development from pretraining to production systems, utilizing RL frameworks for training infrastructure and model training.

What you'd actually do

  1. Utilize RL frameworks and infra to support training infrastructure and model training.
  2. Communicate and coordinate effectively across organizations.
  3. Design and iterate on training recipes.
  4. Develop multimodal models (e.g., audio or audio-visual)
  5. Drive end-to-end development from pretraining checkpoints to production systems.

Skills

Required

  • software development
  • testing
  • launching software products
  • building and deploying ML models
  • data preparation
  • training ML models
  • evaluation of ML models
  • software design and architecture

Nice to have

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field
  • data structures and algorithms
  • technical leadership role leading project teams and setting technical direction
  • structured organization involving cross-functional, or cross-business projects

What the JD emphasized

  • 5 years of industry experience building and deploying ML models
  • 5 years of experience in data preparation, training, and evaluation of ML models

Other signals

  • Develop multimodal models (e.g., audio or audio-visual)
  • Drive end-to-end development from pretraining checkpoints to production systems
  • Utilize RL frameworks and infra to support training infrastructure and model training