Research Software Engineer, Multimodal AI

Google Google · Big Tech · San Jose, CA +1

This role focuses on developing and enhancing AI agents, particularly in the multimodal domain (audio, vision), for XR devices. The engineer will work on LLMs, agents, and their integration into next-generation computing products, involving algorithm development, production-quality coding, evaluation planning, and shipping AI innovations. The role requires experience in software development, generative AI, ML frameworks, and multimodal learning.

What you'd actually do

  1. Develop algorithms/models to enhance AI agents for XR devices using techniques like prompting, few-shot learning, post-training techniques to improve model performance and real-world XR scenarios.
  2. Write production-quality C++/Python code and tests.
  3. Create a comprehensive evaluation plan, from dataset development to Key Performance Indicator (KPI) definitions and measurements.
  4. Identify, implement and ship the latest modeling innovations including, orchestration, multimodality, tool integrations, memory, hybrid agent architectures and personalization.
  5. Demonstrate concepts through rapid prototyping and iterative development, using team testing in close partnership with the XR product teams.

Skills

Required

  • software development
  • C++
  • Python
  • Generative AI
  • Machine Learning
  • deep learning
  • perception
  • computer vision

Nice to have

  • JAX
  • TensorFlow
  • PyTorch
  • multimodal learning
  • large language models
  • AI agents
  • prompt engineering
  • few-shot learning
  • post-training techniques
  • evaluations
  • large-scale model training
  • deployment
  • technical leadership

What the JD emphasized

  • Develop algorithms/models to enhance AI agents for XR devices
  • Identify, implement and ship the latest modeling innovations including, orchestration, multimodality, tool integrations, memory, hybrid agent architectures and personalization
  • Experience with multimodal learning, large language models or AI agents.

Other signals

  • Develop algorithms/models to enhance AI agents for XR devices
  • Identify, implement and ship the latest modeling innovations including, orchestration, multimodality, tool integrations, memory, hybrid agent architectures and personalization
  • Experience with multimodal learning, large language models or AI agents.