Manager, System Level Reliability Test

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Manager for System Level Reliability Test at NVIDIA, responsible for planning, execution, and management of reliability testing programs for NVIDIA Board and System products. Ensures compliance with industry standards and customer requirements for long-term reliability, quality, and performance. Drives central initiatives, coordinates test methodologies, and develops continuous improvement in reliability processes.

What you'd actually do

  1. Laboratory and People Management: System reliability lab management, maintenance, and expansion. Supervision of lab engineering and technician personnel.
  2. Strategic Leadership: Provide direction for reliability test programs aligned with interpersonal goals; mentor and develop team members.
  3. Program Management: Direct and supervise execution of reliability test plans, encompassing qualification and accelerated life testing.
  4. Standards Compliance: Ensure alignment with standards such as ISO, JEDEC, EIA, IEC, ISTA, NEBS, IEC, and customer-specific reliability standards.
  5. Failure Analysis: Participate in root cause investigations and corrective actions for reliability-related failures.

Skills

Required

  • Bachelor or master’s in electrical or mechanical engineering, or related field.
  • 8+ overall years in electronic product reliability testing
  • 4+ years reliability lab management
  • Proficiency in accelerated Reliability evaluations
  • environmental stress testing
  • failure analysis
  • reliability modeling
  • statistical analysis of electronic products
  • strong leadership
  • project management

Nice to have

  • Experience with various environmental and mechanical test equipment.
  • Experience in crafting reliability testing scripts based on test profiles provided by customers for automotive, servers and or embedded products
  • Knowledge of reliability predictions principles.

What the JD emphasized

  • reliability testing
  • reliability lab management
  • reliability modeling
  • statistical analysis