Currently tracking 427 active AI roles, up 208% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $65k–$331k (avg $193k).
| Title | Stage | AI score |
|---|---|---|
| Software Engineer Software Engineer role focused on building and scaling the inferencing cloud for Large Language Models and GenAI Services within Azure CoreAI Platform. The role involves designing, building, and operating large-scale engineering systems for AI models. | Serve | 7 |
| Senior Software Engineer Senior Software Engineer role focused on designing, developing, and optimizing Azure's High Performance Computing and AI Platform (HPC/AI) virtual machines. This involves deep technical work on hardware/software interactions, device virtualization, and performance analysis of GPU workloads for large-scale AI training and inference. The role contributes to the underlying platform software and its exposure as an Azure service, with opportunities to work on upper layers of Azure infrastructure. | Serve |
| 7 |
| Senior Software Engineer This role focuses on designing and developing next-generation networking infrastructure for large-scale AI training and inference in Azure Cloud. The engineer will work on high-performance, low-latency, and low-jitter communication frameworks, optimizing scalability and reliability for distributed AI workloads. | Serve | 7 |
| Principal Software Engineer Principal Software Engineer role focused on designing, developing, and optimizing networking infrastructure for large-scale AI training and inference in Azure Cloud. The role emphasizes high performance, low latency, and reliability for distributed AI workloads, working with AI accelerators and advanced networking technologies. | Serve | 7 |
| Senior Software Engineer The role focuses on designing and building cutting-edge networking infrastructure for large-scale AI training and inference in Azure Cloud. The goal is to enable breakthroughs in AI by delivering unmatched computational power, scalability, and reliability, with a focus on high performance, low latency, and minimal jitter for distributed AI workloads. | Serve | 7 |
| Member of Technical Staff - Software Engineer (SuperIntelligence team) This role focuses on building and operating the core platform infrastructure for training, evaluating, and deploying large-scale AI models within Microsoft. It involves designing scalable services for cluster orchestration, job scheduling, data pipelines, and artifact management, with a strong emphasis on production operations, cloud platforms (Azure), and enhancing developer experience for AI research and engineering teams. | Serve | 7 |
| Member of Technical Staff, Hardware Health - MAI Superintelligence Team This role is focused on ensuring the reliability, performance, and availability of Microsoft's large-scale AI training infrastructures, which involve tens of thousands of GPUs and advanced networking. The responsibilities include designing transport, fabric architecture, telemetry, observability, and automated troubleshooting for these clusters. The role also involves AI training and inference cluster bring-up, performance benchmarking, and root-cause analysis, with a goal of developing predictive health models and autonomous remediation systems. | Serve | 7 |
| Technical Program Manager - Infrastructure Technical Program Manager for AI Infrastructure at Microsoft AI, focusing on building and optimizing platforms for large-scale foundation model training, deployment, and serving. The role involves coordinating projects, collaborating with researchers and engineers, and driving progress in a 0->1 environment. | ServePost-train | 7 |