Senior Software Engineer – Infrastructure & Network Automation

NVIDIA NVIDIA · Semiconductors · Germany +5 · Remote

Senior Software Engineer role focused on designing and developing automation platforms for NVIDIA's Data Center environments, involving provisioning, configuration, and monitoring of HPC data centers. Requires strong Linux, scripting, DevOps, and networking expertise.

What you'd actually do

  1. Design and develop an automation platform used to provision, configure, and monitor HPC data centers.
  2. Implement scalable, reliable, and maintainable services that enhance cluster visibility and improve operational efficiency.
  3. Collaborate closely with internal and external stakeholders to understand requirements and deliver robust full-cycle solutions.
  4. Improve stability and performance across the provisioning pipeline through architectural enhancements and code optimizations.
  5. Troubleshoot issues in distributed environments and contribute to system observability and reliability improvements.

Skills

Required

  • Linux-based platforms
  • scripting and automation skills (Bash, Python, Ansible)
  • DevOps and Network Engineering practices
  • large-scale network architectures
  • Ethernet, VLANs, TCP/UDP/IP, QoS, L2/L3 protocols, BGP, EVPN/VXLAN
  • containers and cloud-native technologies (Docker, Kubernetes)
  • version control systems (Git)
  • CI/CD pipelines

Nice to have

  • Team Lead/ Scrum master or similar leadership role
  • planning, tracking, and delivering projects
  • DevOps methodologies and tools (e.g., Jenkins, Ansible)
  • Docker and containerized environments
  • agentic AI development