Senior Solutions Architect, Ipp

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +1 · Remote

This role is for a Senior Solutions Architect within NVIDIA's IPP Cloud Infrastructure Team, focusing on optimizing software development workflows and CI/CD systems for internal NVIDIA teams. The role involves architecting, implementing, and supporting cloud infrastructure, with a focus on AI development and testing systems. While the role interacts with AI development, its core function is infrastructure and CI/CD, not direct AI/ML model development.

What you'd actually do

  1. Serve as a Solutions Architect part of GPU Private Cloud team used by thousands of NVIDIANs globally for interactive development, centralized CI / CD and QA testing, with an opportunity to positively impact all of the SW development teams across NVIDIA.
  2. Evaluating, identifying and developing software solutions to optimize critical software development workflows across various organizations within Nvidia.
  3. Architecting, Implementing & supporting end-to-end CI/CD system using open-source and Nvidia proprietary software.
  4. Customer (NVIDIA Internal development teams) onboarding to Private cloud infrastructure with a good discovery of the use case and available solutions within the cloud
  5. Identify performance bottlenecks and optimize the speed and cost efficiency of AI development and testing systems.

Skills

Required

  • BS EE/CS or equivalent experience
  • 12+ years of systems software development
  • 1 year dedicated to developing/exploring AI
  • Maintaining cloud infrastructure
  • Highly available production environment
  • JAVA
  • Python
  • Shell-script
  • Distributed systems
  • REST APIs
  • SQL/NoSQL database systems (MySQL, Cassandra, MongoDB, Elasticsearch)
  • Docker containers
  • Virtual Machines
  • OpenStack
  • Kubernetes
  • Chef/Puppet
  • Hadoop/Ceph/SwiftStack
  • LXC
  • Git
  • Perforce
  • JFrog
  • Kafka

Nice to have

  • Depth in AI, Machine Learning and Deep Learning algorithms and techniques
  • Strong collaborative and interpersonal skills
  • Guiding, evangelizing and influencing senior leaders
  • Designing high-performance, scalable software systems
  • Hardware cost optimization

What the JD emphasized

  • BS EE/CS or equivalent experience with 12+ years of systems software development including at least 1 year dedicated to developing/exploring AI.
  • Experience of maintaining cloud infrastructure and highly available production environment.
  • Strong programming and software development skills in JAVA, Python, Shell-script along with good understanding of distributed systems and REST APIs.
  • Experience in working with SQL/NoSQL database systems such as MySQL, Cassandra, MongoDB or Elasticsearch.
  • Excellent knowledge and working experience with Docker containers and Virtual Machines.
  • Good background of Cloud technologies like: OpenStack, Docker, Kubernetes, Chef/Puppet, Hadoop/Ceph/SwiftStack, LXC, Git, Perforce, JFrog, Kafka.