Skills

Required

PhD degree or equivalent in machine learning, computer science, artificial intelligence, or a related field.
Experience in developing and debugging in Python.
Experience in ML Framework such as PyTorch, JAX or TensorFlow
Experience with distributed training.
Expertise on LLM/LMM pretraining, finetuning, and/or RL.
Expertise on transformer architecture.

Nice to have

Leadership skills to drive sophisticated issues to resolution.
Able to communicate effectively and work optimally with different teams across AMD.

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. **Together, we advance your career. **

THE ROLE:

We are looking for a Principal Applied Research Scientist who is experienced with training large language models and/or large multimodal models. In this role, you will explore novel LLM/LMM architectures and large-scale training techniques to advance the state-of-the-arts. You will be part of a world-class research team working on pre-training, fine-tuning, RL, and aligning large language and multimodal models, in addition to keeping up-to-date to the latest progress and trends in LLM/LMM and foundation models.

THE PERSON:

The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.

KEY RESPONSIBILITIES:

Train, finetune, and RL for LLMs/LMMs.
Improve on the state-of-the-art LLMs/LMMs..
Accelerate the training and inference speed of LLMs/LMMs.
Research novel ML techniques and model architectures.
Influence the direction of AMD AI platform.
Publish your work at top-tier venues.

PREFERRED EXPERIENCE:

Experience in developing and debugging in Python.
Experience in ML Framework such as PyTorch, JAX or TensorFlow
Experience with distributed training.
Expertise on LLM/LMM pretraining, finetuning, and/or RL.
Expertise on transformer architecture.
Strong publication record in top tier conferences and journals.

ACADEMIC CREDENTIALS:

A PhD degree or equivalent in machine learning, computer science, artificial intelligence, or a related field.

LOCATION:

Bellevue, WA or San Jose, CA preferred (hybrid). May consider other US locations.

#LI-MV1

#HYBRID

_Benefits offered are described: _AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

_ _

This posting is for an existing vacancy.