Tech Lead Manager, Freshness and Factuality, Geminiapp, Deepmind

Google Google · Big Tech · Zürich, Switzerland

Tech Lead Manager for GeminiApp, DeepMind, focusing on measuring the intelligence of AI agents through testing systems, developing test problems, and evaluating agent performance. The role involves leading a team of software engineers to ensure freshness and factuality in Gemini applications, driving backend improvements, and productionizing solutions. It requires a strong software engineering foundation, technical leadership, and experience with deep learning/machine learning, with a focus on offline and online evaluation pipelines and agent testing.

What you'd actually do

  1. Develop the mid-term technical vision and roadmap within the scope of your (often multiple) team(s). Evolve the roadmap to meet anticipated future requirements and infrastructure needs.
  2. Review code developed by other engineers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  3. Provide qualitative & quantitative analysis of different parts of the ecosystem.
  4. Perform in-depth loss analysis, develop/improve offline evaluation pipelines and collaborate with research partners to upstream fundamental model losses.
  5. Research, propose and develop innovative backend solutions addressing quality gaps and misalignments, performing and analysing online experiments and productionizing proposed improvements in the C++ backend.

Skills

Required

  • software development
  • Python
  • C++
  • technical leadership
  • overseeing projects

Nice to have

  • deep learning/machine learning
  • complex, matrixed organization

What the JD emphasized

  • measuring the intelligence of our prototypes
  • agent testing
  • developing test problems
  • test new algorithms on robots
  • Freshness and Factuality are critical areas
  • quality gaps and misalignments
  • offline evaluation pipelines
  • online experiments

Other signals

  • measuring intelligence of prototypes
  • agent testing
  • developing test problems
  • testing new algorithms on robots
  • freshness and factuality
  • quality, backend improvements
  • alignment with product needs
  • delivering reliable and personalized experiences
  • offline evaluation pipelines
  • online experiments
  • productionizing proposed improvements