Middleware Development Engineer

Intel Intel · Semiconductors · Oregon, Hillsboro, United States +2

This role focuses on optimizing communication libraries (oneCCL, SHMEM, MPI) for High-Performance Computing (HPC) and AI workloads, specifically identifying and resolving performance bottlenecks in Intel's oneCCL library for AI applications. The engineer will optimize runtime software for distributed systems, ensuring low latency and high bandwidth on Intel GPUs and CPUs, contributing to advancements in scientific discovery and AI systems.

What you'd actually do

  1. Identify performance bottlenecks and additional features necessary to run Argonne AI COE workloads.
  2. Optimize runtime software for distributed computing systems, ensuring optimal latency and bandwidth.
  3. Collaborate with cross-functional teams to define technical specifications and software requirements.
  4. Troubleshoot and resolve complex issues across multiple hardware and software stack layers.
  5. Contribute to software innovations that enhance HPC and AI capabilities at unprecedented scale.

Skills

Required

  • Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, Mathematics, or STEM-related field with 3+ yrs. of experience in software development OR Master's degree in Computer Science, Computer Engineering, Electrical Engineering, Mathematics, or STEM-related field with 1+ yrs. of experience in software development OR Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, Mathematics, or STEM-related field with 3+ months of experience in software development
  • 3+ years of experience in at least one of the following: Distributed computing systems, HPC communication libraries (examples being: MPI, SHMEM, or oneCCL/NCCL), GPU software development, Network communication stack development.

Nice to have

  • Advanced degree (Master's or PhD) in Computer Science, Computer Engineering, Electrical Engineering, Mathematics, or STEM-related field.
  • Proficiency in C and C++ programming.
  • Experience developing in Linux environments.
  • Background in multithreaded programming.
  • Experience in runtime performance optimization, improving communications latency or throughput.
  • Background in developing software for GPUs and collective communication libraries.
  • Strong analytical skills and ability to solve complex software challenges.

What the JD emphasized

  • HPC communication libraries (examples being: MPI, SHMEM, or oneCCL/NCCL)
  • GPU software development
  • runtime performance optimization, improving communications latency or throughput
  • developing software for GPUs and collective communication libraries

Other signals

  • optimize runtime software for distributed computing systems
  • identify performance bottlenecks and additional features necessary to run Argonne AI COE workloads
  • optimize performance across Intel's cutting-edge GPUs and CPUs