Software Engineer, System Validation - Eda Infrastructure

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +4 · Remote

Software Engineer focused on scaling the introduction of next-generation architecture into NVIDIA's EDA Infrastructure, involving NPIs, distributed systems, software testing, deployment, and hybrid cloud/on-prem environments. The role requires leading architecture, design, and implementation of EDA clusters, translating requirements to roadmaps, and ensuring seamless integration from hardware to workloads.

What you'd actually do

  1. Lead technical activities for new product introductions at scale with focus on hybrid deployments between cloud and on-prem
  2. Providing expertise in new product introduction planning spanning infrastructure workflows, hardware, software release, workload orchestration and application tuning
  3. Provide fast and creative solutions for complex problems and write effective, clear and reliable architecture specification
  4. Translate requirements to vision, architecture and roadmap
  5. Work with engineering teams across NVIDIA to ensure new product architectures integrate seamlessly from the hardware all the way up to workloads.

Skills

Required

  • 12+ overall years of experience
  • BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics) or equivalent experience
  • A proven track record of impactful project deliveries focused on cloud infrastructure or cloud application services
  • Experience with at scale infrastructure, DevOps and/or SRE practices and/or Platform Engineering
  • Systematic problem-solving approach
  • Strong communications skills
  • Sense of ownership and drive
  • System-level experience with both hardware and software
  • Motivated self-starter
  • Strong problem-solving skills
  • Customer-facing communication skills
  • Passion for continuous learning and knowledge transfer
  • Ability to work concurrently with multiple groups locally and abroad in the organization

Nice to have

  • Developing ML/AI infrastructure
  • Developing bare metal as a service (BMaaS) associated systems
  • Developing multi-cloud infrastructure services
  • Interest in crafting, analyzing and fixing large-scale distributed systems

What the JD emphasized

  • new product introductions at scale
  • cloud infrastructure
  • DevOps and/or SRE practices and/or Platform Engineering
  • System-level experience with both hardware and software