Staff Software Engineer, Aai Research

Google Google · Big Tech · Zürich, Switzerland

Research role focused on building and improving next-generation AI agents for Google Cloud customers, involving dataset creation, agent design, metric definition, and quality iteration.

What you'd actually do

  1. Execute quality iterations on reference agents identifying potential model or agent framework improvements.
  2. Source or facilitate creation of datasets for evaluation and training to improve model performance for Cloud customers.
  3. Design new agentic systems and context engineering algorithms, utilizing existing models and identifying the need for new training/evaluation objectives where necessary.
  4. Define and implement metrics that correspond to business problems.
  5. Prototype and iterate on the solution working closely with customers, product management and business development.

Skills

Required

  • Python
  • Large Language Models (LLMs)
  • building agents

Nice to have

  • tokens
  • context
  • Retrieval-Augmented Generation (RAG)
  • function calling
  • data
  • basic statistical analysis concepts
  • general data science principles
  • core ML concepts
  • training algorithms
  • best practices for evaluation
  • running quality interactions
  • understanding business goals
  • defining technical metrics
  • implementing evaluation frameworks
  • designing experiments
  • analyzing results
  • performing RCAs
  • formulating hypotheses
  • conducting ablation or live experiments

What the JD emphasized

  • 8 years of experience working with Large Language Models (LLMs) and building agents
  • running quality interactions, involving understanding business goals, defining technical metrics, implementing evaluation frameworks, designing experiments, analyzing results, performing RCAs, formulating hypotheses, and conducting ablation or live experiments

Other signals

  • building next-generation AI agents
  • driving improvements through continuous quality iterations
  • delivering robust, production-ready agentic systems
  • leading the engineering of the ML quality feedback loop
  • research topics that maximize scientific and real-world impact
  • push the state-of-the-art in AI