Research Scientist, Applied Gai-vision

ByteDance ByteDance · Big Tech · San Jose, CA · R&D

Research Scientist role focused on applied research in Generative AI and Computer Vision/Multimodal Understanding, with the goal of delivering intelligent solutions to ByteDance products. The role involves conducting cutting-edge research, transferring advanced technologies, and exploring new AI-centric products, with a focus on generative models for content creation, image/video synthesis, editing, and virtual humans.

What you'd actually do

  1. Conduct cutting-edge research and development in computer vision, generative AI and other related fields;
  2. Transfer advanced technologies to Bytedance products;
  3. Explore new products with artificial intelligence technology at its core.

Skills

Required

  • Computer Vision
  • Generative AI
  • Generative models
  • diffusion models
  • GAN
  • Image and video synthesis
  • Image and video manipulation
  • Image and video editing
  • Image and video understanding
  • Multi-modality
  • Vision and language
  • Generative human
  • Neural motion synthesis
  • Human modeling
  • 2D and 3D pose estimation
  • Hand pose estimation
  • Human shape estimation
  • Facial analysis
  • Algorithms
  • Programming
  • C++
  • Python

Nice to have

  • Publications in CVPR, ICCV, ECCP, SIGGRAPH, NeurIPS

What the JD emphasized

  • 2+ years of research and practical experience in one or more areas of computer vision
  • Generative models
  • diffusion models
  • GAN
  • Image and video synthesis/manipulation/editing
  • Image and video understanding
  • Multi-modality
  • vision and language
  • Generative human
  • neural motion synthesis
  • Human modeling (2D and 3D pose estimation, hand pose estimation, human shape estimation)
  • Facial analysis
  • Strong coding skills in C/C++ and Python

Other signals

  • applied research in Generative AI and CV/Multimodal Understanding
  • delivering intelligent solutions to ByteDance products
  • generative models for content creation, image generation, video synthesis, intelligent image/video editing, and virtual humans