Senior Software Engineer, Ai/ml Training Infrastructure

Google Google · Big Tech · Mountain View, CA +1

The role focuses on building the data and training infrastructure for AI/ML models within Google Search. Responsibilities include scaling distributed systems, optimizing training, and applying research to improve model quality and efficiency. The role requires software development experience, particularly with ML infrastructure.

What you'd actually do

  1. Write and test product or system development code.
  2. Collaborate with peers and stakeholders through design and code reviews to ensure best practices amongst available technologies (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  3. Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  4. Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
  5. Design and implement solutions in one or more specialized ML areas, leverage ML infrastructure, and demonstrate expertise in a chosen field.

Skills

Required

  • software development
  • programming languages
  • software products
  • software design
  • architecture
  • Speech/audio
  • reinforcement learning
  • ML infrastructure
  • model deployment
  • model evaluation
  • optimization
  • data processing
  • debugging

Nice to have

  • data structures
  • algorithms
  • technical leadership

What the JD emphasized

  • ML infrastructure
  • model deployment
  • model evaluation
  • optimization
  • data processing
  • debugging

Other signals

  • scaling large-scale distributed systems
  • optimizing real-time training capabilities
  • applying the latest research to improve model quality and efficiency