What you'd actually do

Lead validation and quality ownership of AI/ML compute stacks on Ubuntu and Yocto

Define validation strategy, test architecture, and coverage across functional, performance, stress, regression, and scalability testing

Own and drive the defect lifecycle, including triage, root cause analysis, and closure

Validate end‑to‑end AI pipelines, including: Model training, conversion, and optimization (e.g., PyTorch → ONNX), Kernel execution, memory transfers, and inference accuracy

Define and execute AI benchmarking and profiling strategies for training and inference workloads

Skills

Required

8–12 years of experience in AI/ML software validation or performance engineering
Strong expertise in Python scripting and test automation
Strong ML fundamentals including deep learning and LLMs
Hands-on experience with ROCm validation, performance profiling, and optimization
Experience with HIP, CUDA, OpenCL, and TensorFlow/PyTorch integrations
Proven experience validating end-to-end AI pipelines
Strong Linux expertise (Ubuntu, Yocto)

Nice to have

Robotics platforms
SIL/HIL environments
real-world deployments
heterogeneous accelerators

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. **Together, we advance your career. **

**AI/ML Software Use Cases Validation Engineer (Lead) **

About the Role

Join AMD’s PAVS team as a Robotics Usecases Validation Engineer where you will Validate and Execute diverse Robotics platforms, variants and customer usecases for advanced robotics systems across software, hardware, and integrated platforms. Your work will ensure safety, reliability, and performance through rigorous testing in simulation, SIL/HIL environments, and real-world deployments.

Job Summary

We are seeking a Lead AI/ML Member of Technical Staff (MTS) with 8–12 years of experience to own and drive validation of AI/ML compute stacks and Physical AI SDKs. This role leads end‑to‑end AI pipeline validation, benchmarking, and performance optimization on Linux platforms, working closely with architecture, compiler, runtime, and hardware teams to deliver production‑quality AI solutions aligned to real customer use cases.

Key Responsibilities

Lead validation and quality ownership of AI/ML compute stacks on Ubuntu and Yocto
Define validation strategy, test architecture, and coverage across functional, performance, stress, regression, and scalability testing
Own and drive the defect lifecycle, including triage, root cause analysis, and closure
Validate end‑to‑end AI pipelines, including:
- Model training, conversion, and optimization (e.g., PyTorch → ONNX)
- Kernel execution, memory transfers, and inference accuracy
Define and execute AI benchmarking and profiling strategies for training and inference workloads
Analyze compute, memory, and latency bottlenecks and drive system‑level and model‑level optimizations
Validate AI frameworks and runtimes (PyTorch, TensorFlow, ONNX Runtime)
Execute and optimize workloads on ROCm/HIP, CUDA, OpenCL, and heterogeneous accelerators
Design and own Python‑based automation frameworks for validation, benchmarking, and reporting
Drive improvements in validation scalability, efficiency, and performance coverage
Collaborate closely with compiler, runtime, driver, and hardware teams
Provide technical leadership and mentorship to senior and junior engineers
Communicate validation status, performance metrics, and quality risks to stakeholders

Required Skills & Qualifications

Technical

8–12 years of experience in AI/ML software validation or performance engineering
Strong expertise in Python scripting and test automation
Strong ML fundamentals including deep learning and LLMs
Hands‑on experience with ROCm validation, performance profiling, and optimization
Experience with HIP, CUDA, OpenCL, and TensorFlow/PyTorch integrations
Proven experience validating end‑to‑end AI pipelines
Strong Linux expertise (Ubuntu, Yocto)

Validation & Process

Performance‑driven validation mindset with focus on production readiness
Ability to lead initiatives independently with strong ownership

Soft Skills

Strong analytical and problem‑solving skills with clear written and verbal communication, and the ability to collaborate effectively with global, cross‑functional teams.

Education

Bachelor’s or Master’s degree in AI/ML, Robotics, Electronics, Computer Science, or a related field.

#LI-NR1

_Benefits offered are described: _AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

_ _

This posting is for an existing vacancy.

WHAT YOU DO AT AMD CHANGES EVERYTHING

**AI/ML Software Use Cases Validation Engineer (Lead) **

About the Role

Job Summary

Key Responsibilities

Lead validation and quality ownership of AI/ML compute stacks on Ubuntu and Yocto
Define validation strategy, test architecture, and coverage across functional, performance, stress, regression, and scalability testing
Own and drive the defect lifecycle, including triage, root cause analysis, and closure
Validate end‑to‑end AI pipelines, including:
- Model training, conversion, and optimization (e.g., PyTorch → ONNX)
- Kernel execution, memory transfers, and inference accuracy
Define and execute AI benchmarking and profiling strategies for training and inference workloads
Analyze compute, memory, and latency bottlenecks and drive system‑level and model‑level optimizations
Validate AI frameworks and runtimes (PyTorch, TensorFlow, ONNX Runtime)
Execute and optimize workloads on ROCm/HIP, CUDA, OpenCL, and heterogeneous accelerators
Design and own Python‑based automation frameworks for validation, benchmarking, and reporting
Drive improvements in validation scalability, efficiency, and performance coverage
Collaborate closely with compiler, runtime, driver, and hardware teams
Provide technical leadership and mentorship to senior and junior engineers
Communicate validation status, performance metrics, and quality risks to stakeholders

Required Skills & Qualifications

Technical

8–12 years of experience in AI/ML software validation or performance engineering
Strong expertise in Python scripting and test automation
Strong ML fundamentals including deep learning and LLMs
Hands‑on experience with ROCm validation, performance profiling, and optimization
Experience with HIP, CUDA, OpenCL, and TensorFlow/PyTorch integrations
Proven experience validating end‑to‑end AI pipelines
Strong Linux expertise (Ubuntu, Yocto)

Validation & Process

Performance‑driven validation mindset with focus on production readiness
Ability to lead initiatives independently with strong ownership

Soft Skills

Strong analytical and problem‑solving skills with clear written and verbal communication, and the ability to collaborate effectively with global, cross‑functional teams.

Education

Bachelor’s or Master’s degree in AI/ML, Robotics, Electronics, Computer Science, or a related field.

#LI-NR1

_Benefits offered are described: _AMD benefits at a glance.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

_ _

This posting is for an existing vacancy.

Lead Ai/ml Software Use Cases Validation Engineer

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Required Skills & Qualifications

Technical

Validation & Process

Soft Skills

Required Skills & Qualifications

Technical

Validation & Process

Soft Skills