Software Engineer Iii, Speech Production, Infrastructure

Google Google · Big Tech · Mountain View, CA +1

The Software Engineer III, Speech Production, Infrastructure role at Google focuses on building and maintaining the serving infrastructure for Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) at scale. This involves writing product or system development code, delivering engineering milestones, profiling and optimizing serving infrastructure for high reliability and efficiency, and triaging production issues. The role requires experience with large-scale infrastructure, distributed systems, and programming languages like C++ and Python, with a preference for knowledge of speech and language technologies.

What you'd actually do

  1. Write product or system development code.
  2. Deliver the engineering milestones for speech production infrastructure, ensuring high reliability and efficiency.
  3. Profile and optimize serving infrastructure to improve utilization and latency, particularly for compute-intensive models.
  4. Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  5. Triage and debug complex system issues in production, analyzing the impact on service operations to ensure stability for partners like Search, Assistant and YouTube, among others.

Skills

Required

  • C++
  • Python
  • large-scale infrastructure
  • distributed systems
  • networks

Nice to have

  • data structures
  • algorithms
  • accessible technologies
  • Automatic Speech Recognition (ASR)
  • Natural Language Understanding (NLU)
  • Text to speech (TTS)

Other signals

  • serving infrastructure
  • high-throughput, low-latency streams
  • compute-intensive models