What you'd actually do

Own the product roadmap for ROCm’s inference software capabilities, including integrations with key frameworks like PyTorch and JAX, serving runtimes like vLLM and SGLang, and the libraries and profiling tools that make inference workloads perform well on AMD hardware.

Define and communicate a coherent strategy for how AMD software enables production inference workloads, covering the full range from single-GPU to rack-scale deployments, with a focus on developer experience, performance portability, and competitive standing.

Serve as AMD’s active presence in the open-source AI/ML community: monitor GitHub repositories, Discord servers, developer blogs, and academic papers to track emerging trends, pain points, and opportunities.

Partner with engineering leads across inference libraries, kernel, runtime, and serving integration teams to translate developer and customer needs into prioritized engineering work.

Balance the signal from large customers, who may need specific optimizations or bespoke integrations, against the needs of the broader open-source community, who value standards, portability, and low friction.

Skills

Required

Product strategy and roadmap ownership
AI/ML inference infrastructure
GPU computing
Open-source software engagement
Technical fluency
Community engagement
Engineering partnership
Customer and partner engagement
Deep familiarity with PyTorch, JAX, or Triton
Practical understanding of LLM inference
Hands-on familiarity with GPU programming (HIP, CUDA, or Triton kernels)
Strong written communication skills

Nice to have

Experience at a GPU vendor, cloud provider AI infrastructure team, or frontier AI lab
Experience contributing to or managing products in an open-source context

Other signals

driving strategy and execution for ROCm, AMD’s open-source GPU software stack

focus on large-scale model inference on AMD Instinct™ and Radeon™ hardware

intersection of the open-source community, high-performance computing, and production AI deployment

customers include some of the organizations deploying and serving the largest language models in the world

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. **Together, we advance your career. **

THE ROLE:

AMD is seeking a Senior Product Manager to drive strategy and execution for ROCm, AMD’s open-source GPU software stack, with a specific focus on large-scale model inference on AMD Instinct™ and Radeon™ hardware. This is a key, central role sitting at the intersection of the open-source community, high-performance computing, and production AI deployment.

You will join a small but focused team of product managers whose shared remit is inference at scale and the frameworks that enable it. You will influence engineering roadmaps, represent AMD in open-source communities and at conferences, and connect the needs of AMD’s most strategic customers with the direction of the open-source ecosystem. AMD’s customers include some of the organizations deploying and serving the largest language models in the world.

THE PERSON:

You are a technically fluent product leader with deep roots in GPU computing, open-source software, and AI inference infrastructure. You thrive on moving fast in a community-driven environment while also navigating the nuanced requirements of enterprise-scale customers. You are as comfortable reading a GitHub issue thread as you are presenting a roadmap to a VP of Engineering at a hyperscaler.

KEY RESPONSBILITIES:

Product Strategy & Roadmap** **

Own the product roadmap for ROCm’s inference software capabilities, including integrations with key frameworks like PyTorch and JAX, serving runtimes like vLLM and SGLang, and the libraries and profiling tools that make inference workloads perform well on AMD hardware.
Define and communicate a coherent strategy for how AMD software enables production inference workloads, covering the full range from single-GPU to rack-scale deployments, with a focus on developer experience, performance portability, and competitive standing.
Identify when emerging OSS model architectures, serving runtimes, or inference techniques require an AMD software response and drive those requirements into the roadmap.
Bring a strategic lens to inference by continuously evaluating the latest research, technology trends, and evolving deployment needs, ensuring the organization stays ahead of future inference requirements and translates market signals into actionable product strategy.

Open-Source Community Engagement** **

Serve as AMD’s active presence in the open-source AI/ML community: monitor GitHub repositories, Discord servers, developer blogs, and academic papers to track emerging trends, pain points, and opportunities.
Own and communicate AMD’s open-source software roadmap, publishing updates across key community projects to build awareness, solicit feedback from model developers and researchers, and signal AMD’s long-term direction to the open-source ecosystem.
Build and maintain relationships with key OSS maintainers, foundation working groups, and community contributors whose work shapes how AMD hardware is perceived and adopted.
Represent AMD at major conferences such as SC, PyTorch Conference, and MLSys; write and review technical blog posts and community announcements on the AMD ROCm blog.

Engineering Partnership & Execution** **

Partner with engineering leads across inference libraries, kernel, runtime, and serving integration teams to translate developer and customer needs into prioritized engineering work.
Drive release planning across rapid release cadences with a CI/CD-first mindset and strong attention to compatibility, regression, and ecosystem readiness.
Provide software-informed feedback to hardware teams on future GPU architecture decisions, particularly where features relevant to inference workloads affect serving performance at scale.

Customer & Partner Engagement** **

Balance the signal from large customers, who may need specific optimizations or bespoke integrations, against the needs of the broader open-source community, who value standards, portability, and low friction.
Collaborate with ecosystem partners such as Hugging Face, Red Hat, and cloud providers on joint integrations and AMD-supported software solutions.

PREFERRED EXPERIENCE:

Proven years of product management experience in a software-focused role, ideally developer tools, GPU computing, or AI/ML infrastructure.
Deep familiarity with at least one of PyTorch, JAX, or Triton: how they are structured, how they dispatch work to hardware backends, and what the end-to-end developer experience looks like.
Practical understanding of LLM inference, including attention mechanisms, KV cache design, quantization strategies such, continuous batching, speculative decoding, and parallel serving across multiple GPUs.
Hands-on familiarity with GPU programming at some level, whether HIP, CUDA, or Triton kernels, with the ability to read kernel-level code and benchmark output even if not writing it day to day.
Strong written communication skills across technical documentation, roadmap narratives, community blog posts, and executive-facing strategy documents.
Prior experience at a GPU vendor, cloud provider AI infrastructure team, or frontier AI lab.
Experience contributing to or managing products in an open-source context: GitHub workflows, community governance, and OSS release management.
Contributions in code, issues, or documentation to major open-source inference projects such as vLLM, SGLang, Triton, or PyTorch.
Familiarity with AMD’s hardware portfolio: Instinct MI-series for data center compute and Radeon for workstation and consumer GPU compute.

ACADEMIC CREDENTIALS:** **

Bachelor’s degree in Computer Science, Electrical Engineering, or a related technical field. Advanced degree a plus but not required given equivalent experience.

LOCATION:

Bay Area · Austin · Remote

This role is not eligible for visa sponsorship.

#LI-EV1

#LI-HYBRID

_Benefits offered are described: _AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

_ _

This posting is for an existing vacancy.