Deep Learning Performance Architect

NVIDIA NVIDIA · Semiconductors · Shanghai, China

NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architecture, analyze performance of deep learning algorithms on different architectures, identify bottlenecks, and explore new features and hardware capabilities. Requires a strong background in computer architecture and experience with deep learning platforms and frameworks.

What you'd actually do

  1. Analyze performance of various deep learning algorithms on different architectures
  2. Identify architecture and software performance bottlenecks and propose optimizations
  3. Explore new features and hardware capabilities on deep learning applications

Skills

Required

  • BSc. MS or PhD in relevant discipline (CS, EE, Math, etc.,)
  • Be familiar with GPU or accelerator-based deep learning platform and software stack
  • A strong background in computer architecture
  • Experience on system architecture design and performance optimization
  • Familiar with machine learning and deep learning frameworks

Nice to have

  • 5+ years of working experience in relevant directions will be a plus

What the JD emphasized

  • deep learning hardware and software architecture
  • deep learning algorithms
  • deep learning platform and software stack
  • machine learning and deep learning frameworks

Other signals

  • GPU and system architectures
  • deep learning modelling
  • performance projections
  • analysis and optimization
  • deep learning hardware and software architecture