Senior Software Engineer – Infrastructure & Network Automation

NVIDIA NVIDIA · Semiconductors · Tel Hai, Israel +1

Senior Software Engineer to join NSV tools (Network Solutions Validation) group, focusing on developing high-performing software automation systems for NVIDIA's Data Center environments. Responsibilities include designing and developing an automation platform for provisioning, configuring, and monitoring HPC data centers, implementing scalable services, and collaborating with stakeholders.

What you'd actually do

  1. Design and develop an automation platform used to provision, configure, and monitor HPC data centers.
  2. Implement scalable, reliable, and maintainable services that enhance cluster visibility and improve operational efficiency.
  3. Collaborate closely with internal and external stakeholders to understand requirements and deliver robust full-cycle solutions.
  4. Improve stability and performance across the provisioning pipeline through architectural enhancements and code optimizations.
  5. Troubleshoot issues in distributed environments and contribute to system observability and reliability improvements.

Skills

Required

  • Linux-based platforms
  • scripting and automation skills (Bash, Python, Ansible)
  • DevOps and Network Engineering practices
  • large-scale network architectures, switches/routers, OVS, SR-IOV, and network operating/management systems
  • Networking expertise: Ethernet, VLANs, TCP/UDP/IP, QoS, L2/L3 protocols, BGP, EVPN/VXLAN, and common network topologies
  • containers and cloud-native technologies (Docker, Kubernetes) and networking performance
  • version control systems (Git) and CI/CD pipelines

Nice to have

  • Team Lead/ Scrum master or similar leadership role
  • planning, tracking, and delivering projects
  • DevOps methodologies and tools (e.g., Jenkins, Ansible)
  • Docker and containerized environments
  • agentic AI development