Technical Solutions Engineer, Ai/ml

Google Google · Big Tech · Pune, Maharashtra, India +1

Technical Solutions Engineer for Google Cloud AI/ML portfolio, focusing on customer-reported issues, deployment failures, and model performance degradation. Responsibilities include troubleshooting, debugging ML models (TensorFlow, PyTorch) in production environments (Kubernetes, Compute Engine), and ensuring production readiness of generative AI models. Requires Python coding, AI/ML concepts, and networking/system administration experience.

What you'd actually do

  1. Troubleshoot and resolve technical issues across the Google Cloud AI/ML portfolio, focusing on customer-reported, deployment failures, model performance degradation and infrastructure-related problems.
  2. Work directly with customers on their ML deployments, including generative AI models to ensure production readiness and high availability.
  3. Utilize coding and scripting skills (primarily Python) to read, debug, and reproduce customer issues within their ML models (TensorFlow, PyTorch) or deployment environments (Kubernetes, Compute Engine).
  4. Manage customer problems through effective diagnosis, clear documentation and the development, implementation of new investigation tools to increase diagnostic speed.
  5. Develop an understanding of Google Cloud's AI/ML solutions and share this knowledge to upskill the wider global support organization.

Skills

Required

  • Python
  • TensorFlow
  • PyTorch
  • Kubernetes
  • Compute Engine
  • AI/ML concepts
  • ML techniques
  • computer networking
  • Linux/Unix system administration

Nice to have

  • Google Cloud certifications
  • Vertex AI
  • Generative AI tools
  • Natural Language Processing (NLP)
  • Computer Vision
  • Recommendation System
  • public cloud infrastructure
  • BigQuery

What the JD emphasized

  • customer-reported
  • deployment failures
  • model performance degradation
  • production readiness
  • high availability
  • debug
  • reproduce customer issues

Other signals

  • customer support
  • ML deployments
  • production readiness
  • debugging ML models