Principal Applied Research Scientist

AMD AMD · Semiconductors · Bellevue, WA +1 · Engineering

Research Scientist at AMD focused on training large language models (LLMs) and large multimodal models (LMMs). The role involves exploring novel architectures, large-scale training techniques, pre-training, fine-tuning, RL, and alignment. The goal is to advance the state-of-the-art and influence AMD's AI platform direction, with a requirement to publish work.

What you'd actually do

  1. Train, finetune, and RL for LLMs/LMMs.
  2. Improve on the state-of-the-art LLMs/LMMs..
  3. Accelerate the training and inference speed of LLMs/LMMs.
  4. Research novel ML techniques and model architectures.
  5. Influence the direction of AMD AI platform.

Skills

Required

  • PhD degree or equivalent in machine learning, computer science, artificial intelligence, or a related field.
  • Experience in developing and debugging in Python.
  • Experience in ML Framework such as PyTorch, JAX or TensorFlow
  • Experience with distributed training.
  • Expertise on LLM/LMM pretraining, finetuning, and/or RL.
  • Expertise on transformer architecture.

Nice to have

  • Leadership skills to drive sophisticated issues to resolution.
  • Able to communicate effectively and work optimally with different teams across AMD.

What the JD emphasized

  • Strong publication record in top tier conferences and journals.

Other signals

  • training large language models
  • large multimodal models
  • novel LLM/LMM architectures
  • large-scale training techniques
  • advance the state-of-the-arts