Principal System Software Engineer, Networking Linux Kernel - Dgx Cloud

NVIDIA NVIDIA · Semiconductors · Tel Aviv, Israel

NVIDIA is seeking a Principal System Software Engineer for their DGX Cloud networking team. This role involves hands-on coding in C for the Linux kernel and userspace, focusing on storage services and networking features for distributed storage solutions. The engineer will research, design, implement, and test new features, analyze and solve complex bugs in large-scale production systems, and work with various teams and architects. The role requires strong C/C++ proficiency, knowledge of networking fundamentals, experience with RDMA technologies, and understanding of distributed systems and cloud computing concepts.

What you'd actually do

  1. 100% hands-on coding role in C language, kernel and userspace
  2. Research, design, implement and test, new and existing, networking features for distributed storage services and features of NVIDIA’s block storage solution, in both Host and DPU environments.
  3. Acquire understanding of the algorithms, the technicalities and the interaction with other components across NVIDIA’s block storage ecosystem.
  4. Analyze and solve challenging bugs and customer cases in large scale production systems, identifying issues in our or inbox kernel modules and often in other components. Drive new solutions based on any issues that arise

Skills

Required

  • B.Sc., M.Sc.. in Computer Science, Electrical Engineering or related discipline (or equivalent experience).
  • 15+ years of experience as a senior developer, preferably in the domains of storage, networking, and/or operating-systems.
  • Strong proficiency in C/C++ programming.
  • Knowledge of networking fundamentals and experience in Linux-based networking environments.
  • Familiarity with RDMA technologies, including Infiniband, RoCE, or iWARP, and experience with RDMA programming models, control and data paths. Comprehension of large and complexed systems.
  • Proven professional experience in designing and developing distributed systems; advantage for experience in block storage and/or networking systems.
  • Ability to work autonomously, with a proactive mindset and perseverance to solve day to day challenges.
  • Ability to quickly adapt to new technology and go deep into new areas
  • Excellent communication skills and a collaborative mindset.
  • Innovative approach, identifying opportunities to improve, accelerate, and reuse existing solutions.
  • Knowledge of cloud computing concepts, including virtualization, scalability, and data management.

Nice to have

  • Linux Kernel coding experience
  • Linux Kernel internals knowledge including memory management, scheduling, etc.

What the JD emphasized

  • 100% hands-on coding role in C language, kernel and userspace
  • 15+ years of experience as a senior developer, preferably in the domains of storage, networking, and/or operating-systems.