Applied Scientist 3

Adobe Adobe · Enterprise · San Jose, CA

This role focuses on post-training and distillation of large generative AI models for images and videos, aiming to improve quality, efficiency, and deployability. Responsibilities include developing distillation pipelines, refining post-training methods (SFT, DPO, GRPO, reward-based learning), building infrastructure for training workflows, and optimizing models for deployment efficiency. The role collaborates with researchers and engineers to adapt complex models into efficient production versions.

What you'd actually do

  1. Develop and run distillation pipelines to transfer capabilities from large teacher models into smaller, efficient student models.
  2. Carry out and refine post-training methods including supervised fine-tuning (SFT), preference optimization (DPO/GRPO), and reward-based learning.
  3. Build infrastructure and tools for teacher rollout creation, distillation data pipelines, and training workflows.
  4. Carry out experiments aimed at improving model quality, efficiency, and instruction alignment for generative AI models.
  5. Collaborate closely with research scientists to convert research ideas into scalable training pipelines and production-ready implementations.

Skills

Required

  • MS or PHD in Computer science or related field.
  • 2+ years practical experience in related field.
  • Expertise in machine learning algorithms and model distillation techniques
  • Strong programming skills in Python or similar languages
  • Experience with AI model training and optimization

Nice to have

  • Advanced degree or relevant experience in a related field
  • background in large-scale data pipelines
  • knowledge of Adobe products

What the JD emphasized

  • post-training
  • distillation
  • model distillation
  • supervised fine-tuning
  • preference optimization
  • model compression
  • inference performance

Other signals

  • refining and compressing large-scale AI models
  • post-training and distillation of large generative AI models
  • adapting large complex models into efficient production versions
  • optimize models for deployment efficiency