Senior AI Hardware Architect

Microsoft Microsoft · Big Tech · Mountain View, CA +2 · Hardware Engineering

Senior AI Hardware Architect role focused on defining and optimizing next-generation AI accelerator platforms and large-scale AI systems. Responsibilities include analytical performance modeling, workload characterization, profiling, and end-to-end performance analysis across GPU and accelerator architectures, working across hardware, software, and system boundaries. The role involves analyzing AI workloads, identifying performance bottlenecks, developing models for new architectural features, and correlating silicon data with models to drive optimizations for performance, efficiency, and TCO. Collaboration with various hardware and software teams is key to shaping future AI accelerator and system architectures.

What you'd actually do

  1. Lead performance analysis, profiling, benchmarking, and analytical modeling across GPU and AI accelerator architectures, identifying bottlenecks, architectural trade-offs, and optimization opportunities across hardware, software, and system layers.
  2. Analyze end-to-end AI workloads and serving systems, including model execution, runtime behavior, memory systems, communication collectives, and workload mapping strategies to understand performance, scalability, efficiency, and cost drivers.
  3. Develop performance, efficiency, and system-level models to evaluate new architectural features, memory and interconnect innovations, collective communication mechanisms, and accelerator design choices, driving perf/W and TCO optimization.
  4. Correlate silicon measurements, software traces, and kernel execution behavior with architectural models and simulators to validate assumptions, improve model fidelity, and guide future architecture decisions.
  5. Drive kernel-level, runtime-level, and system-level performance optimizations across AI training and inference workloads, translating workload insights into actionable hardware and software improvements.

Skills

Required

  • Master's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 3+ years technical engineering experience OR Bachelor's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 5+ years technical engineering experience OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements.

Nice to have

  • 4+ years of experience in Computer Architecture, AI Systems, or closely related technical domains.
  • MS or PhD in Computer Architecture, Computer Systems, Electrical Engineering, Machine Learning, High-Performance Computing, or a related field.
  • Strong understanding of GPU and AI accelerator architectures, including compute pipelines, memory hierarchies, interconnects, collective communication, and parallel execution models.
  • Experience with analytical performance modeling, architectural simulation, workload characterization, and silicon correlation for accelerator and system design.
  • Expertise in performance profiling, benchmarking, and root-cause analysis.

What the JD emphasized

  • AI accelerator platforms
  • large-scale AI systems
  • analytical performance modeling
  • workload characterization
  • profiling
  • end-to-end performance analysis
  • GPU and accelerator architectures
  • AI training and inference workloads
  • performance, efficiency, and system-level models
  • memory and interconnect innovations
  • collective communication mechanisms
  • silicon measurements
  • software traces
  • kernel execution behavior
  • architectural models
  • simulators
  • hardware and software improvements

Other signals

  • AI accelerator platforms
  • large-scale AI systems
  • analytical performance modeling
  • workload characterization
  • profiling
  • end-to-end performance analysis
  • GPU and accelerator architectures
  • AI training and inference workloads