Senior Hardware Test Engineer

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

NVIDIA is seeking a Senior Hardware Test Engineer to build, debug, and manage global HW infrastructure for silicon validation. The role involves overseeing the functionality of a large collection of GPU, SoC, and CPU systems, creating test plans and scripts, and collaborating with global teams to resolve technical challenges. The ideal candidate will have strong expertise in diverse operating systems, particularly Linux internals, and scripting capabilities in Python, Perl, and TCL, along with excellent problem-solving and leadership skills.

What you'd actually do

  1. Oversee the functionality of Hardware Farm, a large collection of GPU, SoC, and CPU systems unified by SW, and perform HW and SW debugs involving PCBs, Voltage Regulators, OS, Drivers, Firmware, etc.
  2. Create test plans and scripts to run sanity checks on farm systems, ensuring HW and SW integrity while working with multiple SW tools involved in automation to identify and fix any infrastructure or script issues
  3. Coordinate with the global farm team and silicon validation teams to ensure the timely availability of farm systems by resolving technical challenges and driving critical debugs by pulling in resources within and outside the team
  4. Collaborate with the Farm-Development team and location-based PIC to enhance the farm process and infrastructure
  5. Apply proactive problem-solving abilities to address issues in the farm infrastructure and drive to closure

Skills

Required

  • B.E/B. Tech in ECE or equivalent experience
  • 5+ years of experience in the HW/Silicon domain
  • In-depth knowledge of various motherboard platforms, SoCs, PC Hardware and components, network connections
  • Strong expertise in diverse operating systems
  • Profound knowledge of Linux OS internals
  • Scripting capabilities (Python, Perl, and TCL)
  • Excellent coordination and interpersonal skills
  • Excellent problem-solving abilities
  • Strong leadership skills

Nice to have

  • AI tools in its recruiting processes

What the JD emphasized

  • critical system debugs
  • HW and SW debugs
  • critical debugs
  • complex hardware
  • complex hardware and software issues