Software Engineer – Infrastructure & Network Automation

NVIDIA NVIDIA · Semiconductors · Yokneam, Israel +1

Software Engineer role focused on designing and developing an automation platform for NVIDIA's Data Center environments, specifically for network solutions validation. The role involves provisioning, configuring, and monitoring HPC data centers, enhancing cluster visibility, and improving operational efficiency through scalable services and code optimizations. It requires collaboration with various stakeholders and troubleshooting in distributed environments.

What you'd actually do

  1. Design and develop an automation platform used to provision, configure, and monitor HPC data centers.
  2. Implement scalable, reliable, and maintainable services that enhance cluster visibility and improve operational efficiency.
  3. Collaborate closely with internal and external stakeholders to understand requirements and deliver robust full-cycle solutions.
  4. Improve stability and performance across the provisioning pipeline through architectural enhancements and code optimizations.
  5. Troubleshoot issues in distributed environments and contribute to system observability and reliability improvements.

Skills

Required

  • B.Sc. in Computer Science, Engineering, or a related field (or equivalent practical experience)
  • 3+ software development experience
  • Familiarity with Networking concept and protocols
  • Scripting and automation skills (Bash, Python, Ansible)
  • Experience with version control systems (Git) and CI/CD pipelines
  • Independent, fast learner with strong ownership mindset
  • Excellent debugging and problem-solving skills
  • Effective communication abilities, Team player, customer oriented

Nice to have

  • Hands-on experience on Linux-based platform
  • Background in DevOps and Network Engineering practices
  • Familiarity with DevOps methodologies and tools (e.g., Jenkins, Ansible)
  • Hands-on experience with large-scale network architectures, switches/routers, OVS, SR-IOV, and network operating/management systems
  • Networking expertise: Ethernet, VLANs, TCP/UDP/IP, QoS, L2/L3 protocols, BGP, EVPN/VXLAN, and common network topologies

What the JD emphasized

  • 3+ software development experience
  • Familiarity with Networking concept and protocols
  • Scripting and automation skills (Bash, Python, Ansible)
  • Experience with version control systems (Git) and CI/CD pipelines
  • Independent, fast learner with strong ownership mindset
  • Excellent debugging and problem-solving skills
  • Effective communication abilities, Team player, customer oriented