Staff Software Engineer, Applied Research, Foundation User Models

Google Google · Big Tech · Mountain View, CA +1

Staff Software Engineer, Applied Research, Foundation User Models at Google. This role focuses on defining and executing the applied research roadmap for Large User Models, translating business goals into technical formulations, optimizing model performance with adaptation techniques, driving architectural improvements by influencing pre-training teams, and productionizing fine-tuning pipelines for recommendation engines. The role requires experience with Transformer-based models and ML design, with a focus on balancing quality output with strict inference latency requirements.

What you'd actually do

  1. Define and execute the applied research roadmap for the Large User Model ecosystem, balancing immediate customer needs with long-term technical evolution to scale foundational models across high-traffic surfaces.
  2. Support initiatives with product leadership to translate complex business goals into technical model formulations, delivering step-function improvements in user engagement and business metrics while optimizing for latency and performance trade-offs.
  3. Optimize model performance by researching and implementing adaptation techniques (transfer learning/domain adaptation) that balances high-quality output with strict inference latency requirements for production environments.
  4. Drive architectural improvements by establishing a strategic feedback loop with pre-training teams, utilizing downstream performance analysis to influence data curation, model architecture, and novel evaluation metrics for engagement-specific needs.
  5. Design and productionize fine-tuning pipelines that translate general-purpose state-of-the-art (SoTA) Foundation User Models into effective, domain-specific recommendation engines.

Skills

Required

  • Transformer-based models (e.g., BERT, T5, GPT, ViT), including attention mechanisms and architecture variations
  • ML design (e.g., model deployment, model evaluation, data processing, debugging, fine-tuning)
  • software development
  • software design and architecture
  • testing, and launching software products

Nice to have

  • publishing in venues or contributing to open-source projects related to RecSys, transfer learning, NLP/CV, or multimodal systems
  • technical leadership role leading project teams and setting technical direction
  • working in a complex, matrixed organization involving cross-functional, or cross-business projects
  • modern recommendation architectures (e.g., two-tower models, sequential user modeling) and how to integrate Large Foundation Models into existing ranking/retrieval stacks
  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field
  • data structures/algorithms

What the JD emphasized

  • production environments
  • inference latency requirements
  • strict inference latency requirements

Other signals

  • Applied research
  • Foundation User Models
  • recommendation engines
  • production environments
  • inference latency