Member of Technical Staff, Reinforcement Learning Systems - Mai Superintelligence Team

Microsoft Microsoft · Big Tech · Mountain View, CA +4 · Software Engineering

This role focuses on designing, developing, and operating large-scale reinforcement learning systems for training agents and reasoning models. It involves contributing to cutting-edge research and bridging the gap between research and production-grade distributed systems, with responsibilities including tuning pretraining software for specific GPU architectures and contributing to AI model development.

What you'd actually do

  1. Develop and tune the pretraining scalable software for Nvidia GB200 72NVL CX8 and AMD MIxxx architectures.
  2. Benchmark GB200 and AMD MIxxx GPU clusters.
  3. Gather data and insights to develop the pretraining compute roadmap.
  4. Care deeply about conversational AI and its deployment.
  5. Actively contribute to the development of AI models that are powering our innovative products.

Skills

Required

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Experience with generative AI.
  • Experience with distributed computing.

Nice to have

  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Experience in leading technical projects and supporting architectural decisions with data.

What the JD emphasized

  • reinforcement learning systems
  • pretraining scalable software
  • AI models
  • generative AI

Other signals

  • reinforcement learning systems
  • large-scale reinforcement learning systems
  • pretraining scalable software
  • AI models
  • generative AI