Software Engineer Ii, ML Telemetry

Google Google · Big Tech · London, United Kingdom

Software Engineer II, ML Telemetry at Google, London, UK. This role focuses on measuring and reporting the efficiency and adoption of the ML fleet, generating metrics to identify optimization opportunities, and providing data-driven feedback to ML job owners and planners. The role involves working with Core Machine Learning (ML) products and services, and influencing the ML Software/Hardware roadmap.

What you'd actually do

  1. Measure and report the efficiency of the ML fleet, generate and collect metrics that help identify optimization opportunities, and drive improvements via changes to Core Machine Learning (ML) products and services.
  2. Measure and report the fleetwide adoption of Core ML products and services.
  3. Collect metrics to inform the ML Software/Hardware (SW/HW) roadmap.
  4. Provide data-driven actionable feedback to ML job owners, Product Area (PA) resource planners, and Fleet resource planners.

Skills

Required

  • software development
  • data structures
  • algorithms
  • large-scale infrastructure
  • distributed systems

Nice to have

  • Machine Learning
  • JAX
  • PyTorch
  • Compilers
  • Computer Architecture
  • Backend programming
  • C++
  • data analytics
  • FlumeC++
  • SQL
  • instrumentation
  • data aggregation pipelines