Senior Software Engineer

Microsoft Microsoft · Big Tech · United States · Software Engineering

The role focuses on designing and building cutting-edge networking infrastructure for large-scale AI training and inference in Azure Cloud. The goal is to enable breakthroughs in AI by delivering unmatched computational power, scalability, and reliability, with a focus on high performance, low latency, and minimal jitter for distributed AI workloads.

What you'd actually do

  1. Leads discussions and owns the architecture of products/solutions, developing complex design specifications and testing design options, ensuring performance, scalability, resilience, and security.
  2. Independently creates clear test strategies that ensure solution quality, incorporating security testing and leveraging AI tools for test automation.
  3. Produces extensible, maintainable, well-tested, secure, and performant code adhering to design specifications, continuously improving code quality and stability using metrics and AI techniques.
  4. Reviews product and test code to ensure compliance with standards, adequate test coverage, and adherence to best practices, coaching others through code reviews.
  5. Independently uses AI tools and practices throughout the software development lifecycle, taking responsibility for AI-generated assets and improving AI integration in processes.

Skills

Required

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, OR Java, JavaScript, or Python
  • Ability to meet Microsoft, customer and/or government security screening requirements

Nice to have

  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • 1+ years experience with any of the following: High Performance Networking, InfiniBand, RoCE, CUDA

What the JD emphasized

  • high performance
  • low latency
  • scalability
  • reliability
  • observability
  • security

Other signals

  • AI supercomputer
  • large-scale AI training
  • distributed AI workloads
  • next-generation networking capabilities
  • low-latency systems
  • AI infrastructure