Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
The Kueue Core team is on a mission to make Kubernetes the most powerful and efficient platform for AI/ML. We are the maintainers of Kueue, a key CNCF open-source project and the leading scheduler for AI workloads on Kubernetes. Our daily work involves designing and building sophisticated scheduling solutions to manage thousands of accelerators like TPUs and GPUs across many users. We address the core challenge: given a large, contended pool of resources, how do we schedule a massive volume of jobs fairly and optimally? This involves deep work in algorithm design, system performance, and Kubernetes internals. Our team collaborates extensively with internal partners building Google's AI services, external AI pioneers, and the open-source community, making a tangible impact on the AI landscape.
Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
Individual pay is determined by factors including job-related skills, experience, and relevant education or training.
Poland: zł206000 - zł212000 (PLN) + 15% bonus target + equity + benefits
Learn more about benefits at Google.
Responsibilities
- Develop and enhance the Kueue open-source project, focusing on core scheduling algorithms, queueing mechanisms, and overall performance.
- Pioneer the use of Agentic AI to assist in Kueue code development and to diagnose complex scheduling issues.
- Implement support for accelerators, ensuring efficient and high-performance scheduling for hardware like TPU7X, TPU8, and Nvidia GB200/GB300, as well as large-scale CPU workloads.
- Innovate on advanced scheduling concepts such as topology-aware scheduling to optimize network locality and elastic workload support for dynamic scaling.
- Integrate Kueue with popular AI/ML frameworks like Pathways and Ray. Engage actively with the Kubernetes and CNCF open-source communities to drive the direction of AI workload scheduling.
Qualifications
Minimum qualifications:
- Bachelor’s degree or equivalent practical experience.
- 1 year of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, JavaScript).
- 1 year of experience building and developing large-scale infrastructure or distributed systems.
- 1 year of experience with data structures and algorithms.
Preferred qualifications:
- 1 year of experience with developing distributed systems, at large scale.
- 1 year of experience with software development in Go.
- Experience with concurrent programming.