Currently tracking 489 active AI roles, up 170% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $98k–$505k (avg $233k).
| Title | Stage | AI score |
|---|---|---|
| Senior Software Engineering Manager, Emergent AI Infrastructure Senior Software Engineering Manager for Emergent AI Infrastructure, leading teams to build next-generation on-prem AI infrastructure for frontier models and AI solutions. Role involves technical leadership, team management, strategic guidance, and cross-functional collaboration across global sites to deliver high-impact AI infrastructure products. | Serve | 9 |
| Software Engineer III, AI/ML Cloud Software Engineer III, AI/ML Cloud role at Google, focusing on building and deploying scalable AI/ML platform capabilities for Google Cloud customers. The role involves engineering solutions for LLM serving and tuning, ensuring reliability, performance, and quality, and implementing guardrails and safety gates for the Vertex AI organization. It requires experience in ML infrastructure, model deployment, and potentially speech/audio or reinforcement learning. | Serve |
| 8 |
| Staff Software Engineer, Vector Search, Vertex AI Staff Software Engineer role focused on Vector Search Serving infrastructure for Google Cloud's Vertex AI platform, requiring experience with large-scale distributed systems and GenAI concepts. | ServeAgent | 7 |
| Software Engineer III, Full Stack, AI Development Tools Software Engineer III role focused on developing GenAI and Vertex-based development tools, including server and middleware code that interacts with LLMs. The role involves integrating next-generation LLMs with a focus on performance and deployment constraints, working closely with AI researchers and product managers. | Serve | 7 |
| Technical Lead, Vector Search, Vertex AI Technical Lead for Vector Search Serving infrastructure within Google Cloud's Vertex AI platform, focusing on managing massive datasets and query volumes with low latency. The role involves technical leadership, architecture, code reviews, and mentoring engineers, with a requirement for experience in GenAI techniques. | ServeAgent | 7 |
| Senior Software Engineering Manager, Flume ML Seeking an Engineering Manager with an infrastructure technical background to lead a team focused on optimizing planet-scale data processing infrastructure. This platform powers foundational AI/ML features and next-generation AI initiatives, with a focus on advancing scheduleability and auto-tuning for ML pipelines and standard data workloads. The role involves leading research into AI-driven optimization strategies and collaborating with teams working on frontier AI models. | Serve | 7 |
| Senior Software Engineer, AI/ML Infrastructure Senior Software Engineer focused on optimizing AI/ML infrastructure performance across the technical stack, from networking and data storage to ML models, to provide AI developers with a high-performance experience on Google's AI infrastructure. Responsibilities include designing and implementing solutions, optimizing performance, profiling, debugging, and developing tools for AI/ML infrastructure. | ServeData | 7 |
| Software Engineer, Early Careers, PhD, Cloud AI Software Engineer role focused on implementing and designing Search features, leveraging advanced GenAI capabilities and LLM concepts, with a focus on Kubernetes-based infrastructure for scalability and performance. | Serve | 7 |
| Staff Software Engineer, Load Balancing and Agent Gateway Staff Software Engineer role focused on designing and leading feature development for the GKE Agent Gateway, enhancing capabilities for AI/ML workloads and contributing to the Kubernetes open-source project. The role involves collaboration with Load Balancing teams, ensuring optimal routing, load balancing, and autoscaling for inference serving, and improving observability, testability, and reliability of the Inference Gateway. | Serve | 5 |
| Software Engineer, Performance, Reliability, Observability, PhD, Early Career Software Engineer role focused on performance, reliability, and observability tools for Google Cloud control plane systems. The role involves analyzing VM performance, developing performance models, designing benchmarks, and exploring the use of machine learning for anomaly detection. While the core is engineering and performance analysis, there's an exploration component involving ML for anomaly detection. | Serve | 5 |