Software Engineer, Managed Service for Apache Spark

Google Google · Big Tech · Kirkland, WA +1

Software Engineer role focused on developing and operating a managed cloud service for Apache Spark, enabling users to run data analytics and machine learning workloads efficiently in the cloud. The role involves designing and implementing features for ease of use, efficiency, high-availability, and fault-tolerance in cluster orchestration and resource provisioning.

What you'd actually do

  1. Design and implement features that make Dataproc easier to use and and the most efficient way to run Spark workloads.
  2. Architect, implement, and operate the core Dataproc Control Plane, focusing on high-availability, fault-tolerance, and state management for global-scale resource provisioning and cluster orchestration.
  3. Build high-impact customer-facing features which make Dataproc the best place to run Hadoop, Spark, Flink, AI/ML and newer technologies in the cloud.

Skills

Required

  • software development
  • data structures
  • algorithms

Nice to have

  • Apache Spark
  • Apache YARN
  • Hadoop Distributed File System
  • Compute Engine