Senior Solutions Architect, Generative AI

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +2 · Remote

Senior Solutions Architect role focused on customer engagements for NVIDIA's generative AI technologies, involving AI model training and deployment optimization, particularly for LLMs and recommenders in the consumer internet industry. Requires strong coding, GPU optimization, and communication skills.

What you'd actually do

  1. Collaborating closely with customers to improve their workload performance and reduce infrastructure costs.
  2. Leading and developing proof-of-concepts for AI solutions applied to the Consumer Internet industry, including areas like LLMs and recommenders, and building collateral (notebook/code) as needed.
  3. Developing and debugging software for NVIDIA and open-source AI frameworks and libraries.
  4. Partnering with NVIDIA’s software engineering, product, and sales teams to secure design wins and drive the development of innovative solutions based on customer feedback.

Skills

Required

  • Python
  • C++
  • AI software libraries
  • GPUs
  • model training performance optimization
  • inference performance optimization
  • GPU kernels
  • GEMM
  • attention kernels
  • communication skills
  • collaboration

Nice to have

  • Full stack experience
  • DL framework level (PyTorch/JAX)
  • low level (CUDA/CUTLASS/cuDNN/NCCL)
  • enterprise developers
  • customer-facing skills
  • MLOps technologies
  • containers
  • Kubernetes
  • data center deployments
  • large-scale production data pipelines
  • AI model training
  • AI model deployment
  • creative problem-solving

What the JD emphasized

  • proven track record coding in Python and/or C++ with popular AI software libraries and GPUs
  • Experience with profiling and optimizing model training/inference performance on GPUs
  • Experience developing and optimizing GPU kernels for deep learning, with a focus on GEMM and attention kernels

Other signals

  • customer facing
  • performance optimization
  • GPU kernels
  • AI frameworks