Infrastructure Software Engineer, Deep Learning Libraries

NVIDIA NVIDIA · Semiconductors · Shanghai, China +1

NVIDIA is seeking an Infrastructure Software Engineer to enable high-performing deep learning libraries like TensorRT and TensorRT-LLM. The role involves designing and developing scalable automation for build, test, integration, and release processes across diverse platforms, including autonomous vehicles and datacenters. The engineer will work with industry-standard tools like Kubernetes and Jenkins, and contribute to advancing these tools.

What you'd actually do

  1. Designing and developing software for testing and analysis of our codebases
  2. Building scalable automation for build, test, integration, and release processes for publicly distributed deep learning libraries
  3. Developing throughout the software stack, from the user experience down to the cluster and database layers
  4. Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Kubernetes, Jenkins, Docker, CMake, Github, Gitlab, Jira, etc)
  5. Advancing state of the art in those industry-standard tools

Skills

Required

  • BS or equivalent experience or higher degree in Computer Science or Computer Engineering
  • 2+ years of relevant experience
  • Strong programming skills in Python (or similar) and familiarity with C/C++ development
  • Experience setting up, maintaining, and automating continuous integration systems (e.g. Jenkins)
  • Fluency in SCM (e.g. Git, Perforce) and build systems (e.g. Make, CMake, Bazel)
  • A pragmatic approach to solving problems and collaboration

Nice to have

  • Experience designing and developing automation in Jenkins with Groovy (or similar)
  • Background with distributed systems and cluster/cloud computing, especially with Kubernetes
  • Experience designing and developing unit and integration test frameworks
  • Hands-on experience with code coverage and static code analysis tools
  • Experience with GPU, mobile/embedded platforms and multiple operating systems (Ubuntu, RedHat, Windows, QNX, L4T, or similar)