Hardware Systems Engineer

Microsoft Microsoft · Big Tech · Redmond, WA +4 · Hardware Engineering

This role is for a Senior Hardware Systems Engineer in Microsoft's Azure Cloud Hardware and Infrastructure Engineering (CHIE) team. The engineer will develop system validation plans for Azure's hardware solutions, architect and develop test and debug frameworks, build test automation, and partner with technology providers. Responsibilities include planning, designing, and executing system validation plans for state-of-the-art HW solutions, driving continuous improvement in testing methodologies, working with OEMs/ODMs, identifying and resolving server subsystem faults, developing automation and tooling, and handling a DevOps role with on-call responsibilities. The role requires a Master's or Bachelor's degree in a relevant engineering field with 2+ years of experience in server systems/platforms development and validation, hardware validation architecture, test infrastructure development, automation, and programming languages like Python/PowerShell. Experience with system-level benchmarking, Windows/Linux internals, hardware/firmware/OS interactions, debugging complex issues, and hardware validation tools is preferred. Experience with AI/ML workloads for benchmarking is also mentioned as a preferred qualification.

What you'd actually do

  1. Plan, design and execute System validation plans, test frameworks for state-of-the-art HW solutions based on CPU/GPU applications to confirm design meets cloud grade quality.
  2. Drive continuous improvement to achieve unified and standard testing, validation and debug methodology – adopt automation, AI Capabilities to drive efficiency and enhance test coverage.
  3. Work with OEMs/ODMs and other system engineers to run system validation, SKU qualification, scale testing, and system debugging.
  4. Hands on validation and debug work with test engineers in the laboratory.
  5. Identify, triage and resolve server subsystem faults; drive end-to-end root cause with cross team partners and implement fixes.

Skills

Required

  • Master's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field OR Bachelor's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 2+ years technical engineering experience OR equivalent experience.
  • 2+ years of relevant experience in server systems/platforms development and validation for enterprise or cloud market segments.
  • 2+ years of hands-on experience in server hardware validation architecture, developing test infrastructure, writing test cases, automation and executing tests.
  • 2+ years of experience in programing languages such as Python/PowerShell or similar for automation development or integration.

Nice to have

  • Experience developing system‑level benchmarking or validation tools using C/C++ on PC or server platforms.
  • Proven knowledge of Windows and Linux internals, including threading, scheduling, synchronization, and atomic operations.
  • Proven understanding of hardware, firmware, and OS interactions, including CPU/GPU architectures and platform design trade‑offs.
  • Hands‑on experience debugging complex system‑level issues across hardware, firmware, drivers, OS, and thermal behavior.
  • Proficiency with hardware validation and debug tools (e.g., logic analyzers, oscilloscopes, PCIe analyzers).
  • Familiarity with platform technologies such as PCIe, memory subsystems, networking, and power management.
  • Experience with performance benchmarking and data analysis, including industry benchmarks (e.g., SPEC, Linpack, AI/ML workloads) and system‑level insights.

What the JD emphasized

  • 2+ years of relevant experience in server systems/platforms development and validation for enterprise or cloud market segments.
  • 2+ years of hands-on experience in server hardware validation architecture, developing test infrastructure, writing test cases, automation and executing tests.
  • 2+ years of experience in programing languages such as Python/PowerShell or similar for automation development or integration.