What you'd actually do

Set the technical vision and roadmap for workload optimization across the AI software stack, ensuring AMD remains the platform of choice for top-tier AI customers.

Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure "out-of-the-box" performance excellence on AMD hardware.

Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations.

Collaborate across hardware architecture, compiler, and framework teams to influence future silicon features based on evolving AI workload trends.

Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting.

Skills

Required

15+ years of software development experience with at least 5 years in a high-level technical leadership role (Fellow or equivalent).
Deep expertise in AI Frameworks (PyTorch, JAX, vLLM, SGLang) and the ROCm software stack.
Proven history of optimizing distributed inference and training at scale across multi-node/multi-GPU environments.
Mastery of performance profiling tools (e.g., TorchProfiler, ROCm Profiler, Nsight) and hardware-level performance modeling.
Strong understanding of modern model architectures (Transformer, Attention, KV Cache) and optimization techniques like quantization, speculative decoding, and FlashAttention.
Demonstrated ability to drive cross-functional initiatives in fast-paced, ambiguous environments.
PhD or Master’s degree in Computer Science, Electrical Engineering, or a related field, or equivalent experience.
Demonstrated research or applied experience in AI/ML, including areas such as deep learning, model training/inference optimization, large language models, or computer vision.

Nice to have

AI hardware architecture
software optimization
customer engagement
technical leadership
compiler optimization
framework optimization
performance estimation tools
performance modeling tools
automated reporting tools

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. **Together, we advance your career. **

Fellow, AI Software (Workload Optimization)

We are looking for a visionary technical leader to join the AI Software group. As a Fellow, you will be accountable for defining and driving the end-to-end software optimization strategy to achieve industry-leading performance for our top-tier customers. You will sit at the intersection of architecture, customer engagement, and software engineering, ensuring that AMD’s software stack—from ROCm and compilers to high-level AI frameworks—is tuned to extract maximum performance for the world's most demanding AI workloads.

THE PERSON

The ideal candidate is a technical powerhouse with a proven track record of solving complex performance bottlenecks at scale. You possess deep knowledge of AI hardware architecture and software optimization, with the ability to map emerging model architectures to low-level software. You are an exceptional communicator who can translate complex technical challenges into strategic roadmaps and influence senior leadership, while also engaging deeply with key customers to solve their most critical performance needs.

KEY RESPONSIBILITIES

Strategic Leadership: Set the technical vision and roadmap for workload optimization across the AI software stack, ensuring AMD remains the platform of choice for top-tier AI customers.
Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure "out-of-the-box" performance excellence on AMD hardware.
Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations.
Hardware-Software Co-design: Collaborate across hardware architecture, compiler, and framework teams to influence future silicon features based on evolving AI workload trends.
Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting.
Community & Mentorship: Act as a technical ambassador in industry forums and open-source communities. Mentor and inspire the next generation of AMD’s technical leaders and engineers.

PREFERRED EXPERIENCE

15+ years of software development experience with at least 5 years in a high-level technical leadership role (Fellow or equivalent).
Deep expertise in AI Frameworks (PyTorch, JAX, vLLM, SGLang) and the ROCm software stack.
Proven history of optimizing distributed inference and training at scale across multi-node/multi-GPU environments.
Mastery of performance profiling tools (e.g., TorchProfiler, ROCm Profiler, Nsight) and hardware-level performance modeling.
Strong understanding of modern model architectures (Transformer, Attention, KV Cache) and optimization techniques like quantization, speculative decoding, and FlashAttention.
Demonstrated ability to drive cross-functional initiatives in fast-paced, ambiguous environments.

ACADEMIC CREDENTIALS

PhD or Master’s degree in Computer Science, Electrical Engineering, or a related field, or equivalent experience.
Demonstrated research or applied experience in AI/ML, including areas such as deep learning, model training/inference optimization, large language models, or computer vision.

_Benefits offered are described: _AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

_ _

This posting is for an existing vacancy.

WHAT YOU DO AT AMD CHANGES EVERYTHING

Fellow, AI Software (Workload Optimization)

THE PERSON

KEY RESPONSIBILITIES

Strategic Leadership: Set the technical vision and roadmap for workload optimization across the AI software stack, ensuring AMD remains the platform of choice for top-tier AI customers.
Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure "out-of-the-box" performance excellence on AMD hardware.
Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations.
Hardware-Software Co-design: Collaborate across hardware architecture, compiler, and framework teams to influence future silicon features based on evolving AI workload trends.
Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting.
Community & Mentorship: Act as a technical ambassador in industry forums and open-source communities. Mentor and inspire the next generation of AMD’s technical leaders and engineers.

PREFERRED EXPERIENCE

15+ years of software development experience with at least 5 years in a high-level technical leadership role (Fellow or equivalent).
Deep expertise in AI Frameworks (PyTorch, JAX, vLLM, SGLang) and the ROCm software stack.
Proven history of optimizing distributed inference and training at scale across multi-node/multi-GPU environments.
Mastery of performance profiling tools (e.g., TorchProfiler, ROCm Profiler, Nsight) and hardware-level performance modeling.
Strong understanding of modern model architectures (Transformer, Attention, KV Cache) and optimization techniques like quantization, speculative decoding, and FlashAttention.
Demonstrated ability to drive cross-functional initiatives in fast-paced, ambiguous environments.

ACADEMIC CREDENTIALS

PhD or Master’s degree in Computer Science, Electrical Engineering, or a related field, or equivalent experience.
Demonstrated research or applied experience in AI/ML, including areas such as deep learning, model training/inference optimization, large language models, or computer vision.

_Benefits offered are described: _AMD benefits at a glance.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

_ _

This posting is for an existing vacancy.

Fellow, AI Workload Optimization

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Fellow, AI Software (Workload Optimization)

THE PERSON

KEY RESPONSIBILITIES

PREFERRED EXPERIENCE

ACADEMIC CREDENTIALS

Fellow, AI Software (Workload Optimization)

THE PERSON

KEY RESPONSIBILITIES

PREFERRED EXPERIENCE

ACADEMIC CREDENTIALS