Senior Software Engineer, Infrastructure, High Performance Networking

Google Google · Big Tech · New York, NY +2

Senior Software Engineer on the High Performance Networking (HPN) team, focusing on end-to-end RDMA stack ownership. The role involves enhancing network performance for HPC and ML workloads, optimizing RDMA capabilities with next-generation Smart NICs, and performing full-stack optimizations across Google Cloud infrastructure. Responsibilities include writing and testing code, participating in design reviews, code reviews, documentation, and debugging system issues.

What you'd actually do

  1. Write and test product or system development code.
  2. Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
  3. Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  4. Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  5. Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.

Skills

Required

  • software development
  • programming languages
  • software design and architecture
  • large-scale infrastructure
  • distributed systems
  • networks
  • compute technologies
  • storage
  • hardware architecture

Nice to have

  • data structures and algorithms
  • UNIX/Linux open source developments
  • kernel/device drivers
  • networking
  • systems architecture
  • compilers
  • operating systems
  • modeling and analysis
  • kernel device drivers
  • performance debugging and optimization
  • design of performance tools
  • compiler design and code optimization
  • high-performance software development techniques
  • concurrent programming
  • multi-core computer architectures

What the JD emphasized

  • end-to-end (E2E) ownership of the RDMA (Driver + guest) stack
  • full-stack optimizations
  • HPC and ML workloads
  • RDMA capabilities
  • next-generation Smart NICs