Solutions Architect - Devops

NVIDIA NVIDIA · Semiconductors · Australia · Remote

NVIDIA is seeking a Senior Cloud Infrastructure and DevOps Solutions Architect to manage and optimize large-scale AI/HPC infrastructure, focusing on Kubernetes, automation, monitoring, and customer engagement for AI operational projects.

What you'd actually do

  1. Maintain large scale computational and AI infrastructure, focusing on monitoring, logging, workload orchestration (Kubernetes and Linux job schedulers).
  2. Optimize scalable, production-ready Kubernetes-based container platforms coordinated with enterprise-grade networking and storage.
  3. Serve as a key technical resource, develop, refine, and document standard methodologies and operational guidelines to be shared with internal teams.
  4. Perform end-to-end resolving across the stack, from bare metal and operating system, through the software stack, container platform, networking, and storage.
  5. Support Enterprise, Research & Development activities and engage in POCs/POVs to validate new features, architectures, and upgrade approaches.

Skills

Required

  • Kubernetes
  • HPC/AI clusters
  • Networking fundamentals
  • Linux
  • Python
  • Bash scripting
  • Observability stacks (Grafana, Loki, Prometheus)
  • Configuration management
  • Infrastructure-as-Code tools (Ansible, Terraform)

Nice to have

  • CI/CD pipelines
  • Kubernetes and container-based microservices architectures
  • GPU-focused hardware and software (NVIDIA DGX, CUDA, GPU Operator)
  • RDMA-based fabrics (InfiniBand or RoCE)

What the JD emphasized

  • Extensive experience with Kubernetes for container orchestration
  • Proven understanding of networking fundamentals
  • hands-on experience managing HPC/AI clusters
  • Deep knowledge of Linux
  • Proficiency in Python and Bash scripting
  • Experience with observability stacks

Other signals

  • AI/HPC systems
  • large scale AI Operational projects
  • Kubernetes-based platforms
  • Automation
  • monitoring, logging, workload orchestration