Manager, Network Simulation and Infrastructure

NVIDIA NVIDIA · Semiconductors · Tel Aviv, Israel

Manager for a team developing a high-fidelity network simulation platform and its on-premise infrastructure. The role involves team leadership, product ownership of the simulation service, optimizing simulation engines, managing compute clusters, and driving DevOps/automation. Collaboration with the AI Agents team is mentioned for API exposure.

What you'd actually do

  1. Manage and mentor a team of C++ software engineers and DevOps infrastructure engineers, fostering a culture of performance, reliability, and code quality.
  2. Treat the internal simulation platform as a product. Work with research partners to define the roadmap, prioritize features, and ensure high availability for users.
  3. Be responsible for the architecture and optimization of complex network simulation engines (C++ based), ensuring they can scale to model extensive data center topologies with high fidelity.
  4. Own the lifecycle of our on-premise compute clusters and servers. Drive decisions on hardware upgrades, prioritisation, and managing system resources.
  5. Lead the strategy for CI/CD pipelines, automated testing, and containerized deployments to ensure rapid iteration and stability of the simulation platform.

Skills

Required

  • MSc, Ph.D. or equivalent experience in Computer Science, Electrical Engineering, or a related field.
  • 8+ years of hands-on software engineering experience
  • 3+ years of managerial experience
  • C++ development for high-performance applications
  • System-level programming
  • concurrent programming
  • managing on-premise servers
  • Linux environments
  • Kubernetes
  • Slurm
  • Docker
  • Ansible
  • ensuring uptime
  • monitoring system health
  • optimizing hardware utilization

Nice to have

  • Networking Knowledge
  • computer networking fundamentals
  • data center architectures
  • discrete event simulation (DES)
  • modeling complex systems
  • MPI
  • CUDA
  • High-Performance Computing frameworks
  • OMNeT++
  • NS-3
  • switch micro-architecture
  • NIC design

What the JD emphasized

  • leading technical teams in systems or infrastructure domains for 3+ years
  • managerial experience
  • C++ Expertise
  • Infrastructure & DevOps
  • Operational Rigor