AI Research Scientist, Video Generation and Post Training, Fair

Meta Meta · Big Tech · Bellevue, WA +2

Research Scientist role focused on video generation and post-training of large-scale multimodal models within Meta's Fundamental AI Research (FAIR) team. The role involves developing generative models, optimizing post-training paradigms, and contributing to frontier models for next-generation AI systems, with a focus on video and media generation.

What you'd actually do

  1. Conduct fundamental and applied research in video generation, including generative models, video synthesis, and multimodal learning
  2. Develop and optimize post-training paradigms for large-scale video and multimodal models, improving their performance, robustness, and generalization
  3. Collaborate with teams across Meta to build perceptual foundations for real-time embodied agents and conversational AI
  4. Contribute to the development and deployment of frontier models (e.g., Llama, LMMs) and push the boundaries of video and media generation

Skills

Required

  • video generation
  • computer vision
  • multimodal AI
  • large-scale model training
  • post-training optimization techniques
  • data curation
  • video synthesis
  • multimodal fusion techniques
  • video-language models
  • complex problem solving
  • interdisciplinary team collaboration

Nice to have

  • PhD or equivalent experience
  • expertise in video generation
  • expertise in computer vision
  • expertise in multimodal AI
  • experience with large-scale model training
  • experience with post-training optimization techniques
  • experience with data curation
  • experience with video synthesis
  • experience with multimodal fusion techniques
  • experience with video-language models
  • experience solving complex problems
  • experience working and communicating cross-functionally in a collaborative, interdisciplinary team environment

What the JD emphasized

  • publication record
  • Proven track record of achieving significant results, as demonstrated by grants, fellowships, patents, or publications at leading workshops, journals, or conferences (e.g., NeurIPS, ICML, ICLR, CVPR, ICCV)

Other signals

  • video generation
  • post-training
  • large-scale models
  • multimodal learning
  • frontier models