Network Engineer, Hpc - Paris Region

Mistral AI Mistral AI · AI Frontier · Paris, France · Engineering & Infra

Mistral AI is seeking an HPC Network Engineer to design, deploy, and optimize high-performance network infrastructures for their HPC clusters and AI workloads. The role involves collaborating with cross-functional teams to ensure seamless integration of networking solutions with compute, storage, and cloud platforms, directly impacting the performance, reliability, and scalability of AI research and production environments. Responsibilities include designing and implementing low-latency network architectures, troubleshooting complex network issues, monitoring performance, and staying updated with emerging HPC networking technologies.

What you'd actually do

  1. Design, implement, and optimize high-performance, low-latency network architectures for HPC environments, including InfiniBand, RoCE, and high-speed Ethernet.
  2. Collaborate with HPC, DevOps, and AI research teams to integrate networking solutions with compute clusters, storage systems, and cloud platforms.
  3. Troubleshoot and resolve complex network issues to minimize downtime and maximize performance.
  4. Follow escalation procedures and ensure solutions are provided in a timely manner. Ensure escalation is progressing accordingly with the given severity.
  5. Monitor network performance, capacity, and security, implementing improvements as needed.

Skills

Required

  • Proficiency in HPC networking protocols (InfiniBand, RoCE, TCP/IP, MPLS).
  • Hands-on experience with network hardware (switches, routers, NICs) from vendors like Mellanox, Cisco, or Arista.
  • Knowledge of network automation tools (Ansible, Python scripting).
  • Familiarity with HPC environments, parallel computing, and distributed systems.
  • Experience with network security best practices.
  • Strong problem-solving and analytical skills.
  • Ability to thrive in a fast-paced, collaborative environment.
  • Excellent communication skills (English required; French is a plus).
  • Teaching and documentation skills to ensure knowledge is archived and distributed to team members.

What the JD emphasized

  • high-performance, low-latency network architectures
  • HPC environments
  • InfiniBand, RoCE, and high-speed Ethernet
  • AI research and production environments
  • complex network issues
  • network performance, capacity, and security