What you'd actually do

Design, develop, and deploy scalable and agentic AI solutions for high-value, real-world multimodal conversational AI use cases on smart glasses.

Gain an understanding of the Gemini Live and Astra tech stack and infrastructure. Optimize agent architecture/orchestration to ensure efficient deployment and operation at scale, with a focus on inference cost optimization.

Take ownership of AI quality for production systems. This includes defining technical metrics, implementing evaluation frameworks, analyzing loss patterns, and driving improvements through data collection and smart data generation and model enhancements.

Implement, optimize, and advance AI techniques, with a focus on multimodal conversational quality, multimodal tool use, and multimodal goal-oriented reasoning.

Skills

Required

software development in Python or C++
ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging)
GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision)

Nice to have

data structures and algorithms
applied research to enable new functionality and improve the quality and efficiency of large language and multimodal models
machine learning and statistics

Other signals

building agentic AI solutions

multimodal experience

Gemini Live and Astra

smart glasses

goal-oriented reasoning tasks

multimodal tools and extensions

data, evaluation, and post-tuning of the Gemini model

AI and XR convergence

augment human intelligence

personalized, conversational, and contextually aware experiences

scalable and agentic AI solutions

real-world multimodal conversational AI use cases on smart glasses

agent architecture/orchestration

inference cost optimization

AI quality for production systems

evaluation frameworks

model enhancements

multimodal conversational quality

multimodal tool use

multimodal goal-oriented reasoning

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

Our team is at the forefront of building the next generation of conversational AI. We're developing agentic AI solutions for smart glasses, utilizing Gemini Live and Astra to create a unique and trusted multimodal experience. This technology delivers instant, natural conversational intelligence directly to the user's eye, allowing them to navigate their world more immersively than ever.

In this role, you will design multimodal agentic solutions focused on goal-oriented reasoning tasks. You will enhance and develop new multimodal tools and extensions. You will define and execute the strategy for data, evaluation, and post-tuning of the Gemini model to enhance its impact for smart glasses use cases.

For decades, the computing revolution has reshaped our world driven by breakthroughs in compute, connectivity, mobile, and now, AI. Google's XR team is at the forefront of the next major leap – the convergence of AI and XR. This is more than just new devices – it's about reimagining how we interact with the world around us. We're building a future where lightweight XR devices like smart glasses and headsets pair with helpful AI to augment human intelligence, offering personalized, conversational, and contextually aware experiences.Individual pay is determined by factors including job-related skills, experience, and relevant education or training.

US: $147000 - $211000 (USD) + 15% bonus target + bonus + equity + benefits

Learn more about benefits at Google.

Responsibilities

Design, develop, and deploy scalable and agentic AI solutions for high-value, real-world multimodal conversational AI use cases on smart glasses.
Gain an understanding of the Gemini Live and Astra tech stack and infrastructure. Optimize agent architecture/orchestration to ensure efficient deployment and operation at scale, with a focus on inference cost optimization.
Take ownership of AI quality for production systems. This includes defining technical metrics, implementing evaluation frameworks, analyzing loss patterns, and driving improvements through data collection and smart data generation and model enhancements.
Implement, optimize, and advance AI techniques, with a focus on multimodal conversational quality, multimodal tool use, and multimodal goal-oriented reasoning.

Qualifications

Minimum qualifications:

Bachelor’s degree or equivalent practical experience.
2 years of experience with software development in Python or C++.
1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).
Experience with GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision).

Preferred qualifications:

Master's degree or PhD in Computer Science, or a related technical field.
2 years of experience with data structures and algorithms.
Experience conducting applied research to enable new functionality and improve the quality and efficiency of large language and multimodal models.
Knowledge of machine learning and statistics.

US: $147000 - $211000 (USD) + 15% bonus target + bonus + equity + benefits

Learn more about benefits at Google.

Responsibilities

Design, develop, and deploy scalable and agentic AI solutions for high-value, real-world multimodal conversational AI use cases on smart glasses.
Gain an understanding of the Gemini Live and Astra tech stack and infrastructure. Optimize agent architecture/orchestration to ensure efficient deployment and operation at scale, with a focus on inference cost optimization.
Take ownership of AI quality for production systems. This includes defining technical metrics, implementing evaluation frameworks, analyzing loss patterns, and driving improvements through data collection and smart data generation and model enhancements.
Implement, optimize, and advance AI techniques, with a focus on multimodal conversational quality, multimodal tool use, and multimodal goal-oriented reasoning.

Qualifications

Minimum qualifications:

Bachelor’s degree or equivalent practical experience.
2 years of experience with software development in Python or C++.
1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).
Experience with GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision).

Preferred qualifications:

Master's degree or PhD in Computer Science, or a related technical field.
2 years of experience with data structures and algorithms.
Experience conducting applied research to enable new functionality and improve the quality and efficiency of large language and multimodal models.
Knowledge of machine learning and statistics.

Software Engineer Iii, Multimodal Agentic Ai, Xr

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Responsibilities

Qualifications

Minimum qualifications:

Preferred qualifications:

Responsibilities

Qualifications

Minimum qualifications:

Preferred qualifications: