Senior Software Engineering Manager, Dataproc, Google Cloud

Google Google · Big Tech · Sunnyvale, CA +1

Senior Software Engineering Manager for Google Cloud Dataproc, responsible for building an AI-ready platform with GPU support and integrating AI/ML/Data Science workloads. The role involves leading engineering teams, architecting data lakehouse solutions, and fostering innovation in distributed systems and open-source technologies.

What you'd actually do

  1. Provide strategic leadership and goal for a high-performing engineering organization, driving the roadmap for Spark and next-generation Lake House architectures.
  2. Architect global-scale Data Lake and Lake House solutions, integrating open-source standards like Iceberg and Delta Lake with internal BigQuery ecosystems.
  3. Foster a culture of technical innovation and excellence, optimizing open-source stacks for peak performance, security, and enterprise-grade efficiency. Collaborate with cross-functional GCP infrastructure and product teams to deliver seamless, observability-rich platforms for global enterprise customers.
  4. Direct the integration of AI, Machine Learning, and Data Science workloads into the core Dataproc and Lake House offerings. Manage stakeholder relationships and influence internal and external open-source communities to align with Google's strategic objectives.
  5. Oversee organizational health, talent development, and resource allocation across multiple distributed teams to ensure operational excellence and high engagement.

Skills

Required

  • software development
  • technical leadership
  • people management
  • Apache Spark
  • Data Processing
  • Data Analytics

Nice to have

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field.
  • executive managers
  • Google or other Cloud technologies
  • highly scalable distributed systems
  • open source data analytics technologies such as Apache YARN, Spark, Hive, Flink etc.
  • Data Lakes/Lake houses like Iceberg, Delta, BQ Iceberg tables.

What the JD emphasized

  • native GPU support
  • specialized runtimes (PyTorch, TensorFlow) integrated with Vertex AI
  • AI, Machine Learning, and Data Science workloads

Other signals

  • AI-ready platform with native GPU support
  • specialized runtimes (PyTorch, TensorFlow) integrated with Vertex AI
  • Direct the integration of AI, Machine Learning, and Data Science workloads