Senior System Software Engineer - Slurm

NVIDIA NVIDIA · Semiconductors · UT · Remote

Senior System Software Engineer role focused on maintaining, improving, and optimizing Slurm workload manager software written in C. Responsibilities include developing system-level code, debugging complex issues, writing clean code, collaborating with cross-functional teams, and providing technical support. Requires strong C development experience, Linux systems knowledge, debugging skills, and familiarity with containers and GPU technologies.

What you'd actually do

  1. Maintain, improve and optimize software components in C
  2. Develop and maintain system-level and application-level code for Slurm which includes networking, system and device level components
  3. Debug and troubleshoot complex Slurm issues related to reliability and performance
  4. Write clean, maintainable, and well-documented code that adheres to industry standards
  5. Collaborate with cross-functional teams including Operations, Infrastructure, and Deployment

Skills

Required

  • C development
  • Linux systems programming
  • Systems administration
  • Technical support
  • Software development
  • System-level troubleshooting
  • Memory management
  • Pointers
  • Data structures
  • Algorithms
  • Debugging tools (GDB)
  • Performance profiling
  • Linux kernel interfaces
  • System calls
  • File system
  • Automake
  • Software development lifecycles
  • Agile methodologies
  • Problem-solving skills
  • Analytical skills
  • Containers
  • GPU technologies

Nice to have

  • C++
  • Low-level languages
  • System administration
  • High Performance Computing (HPC)
  • Slurm Workload Manager
  • HPC scheduling systems
  • Operating system internals
  • Hardware-software interaction
  • Open-source C projects

What the JD emphasized

  • 5+ Years of professional experience in C development
  • Strong understanding of memory management, pointers, data structures, and algorithms
  • Solid understanding of Linux kernel interfaces, system calls, and file system including work with Automake