Research Scientist, Applied Gai-vision

ByteDance · Big Tech · San Jose, CA · R&D

Research Scientist role focused on applied research in Generative AI and Computer Vision/Multimodal Understanding, with the goal of delivering intelligent solutions to ByteDance products. The role involves conducting cutting-edge research, transferring advanced technologies, and exploring new AI-centric products, with a focus on generative models for content creation, image/video synthesis, editing, and virtual humans.

What you'd actually do

Conduct cutting-edge research and development in computer vision, generative AI and other related fields;
Transfer advanced technologies to Bytedance products;
Explore new products with artificial intelligence technology at its core.

Skills

Required

Computer Vision
Generative AI
Generative models
diffusion models
GAN
Image and video synthesis
Image and video manipulation
Image and video editing
Image and video understanding
Multi-modality
Vision and language
Generative human
Neural motion synthesis
Human modeling
2D and 3D pose estimation
Hand pose estimation
Human shape estimation
Facial analysis
Algorithms
Programming
C++
Python

Nice to have

Publications in CVPR, ICCV, ECCP, SIGGRAPH, NeurIPS

What the JD emphasized

2+ years of research and practical experience in one or more areas of computer vision
Generative models
diffusion models
GAN
Image and video synthesis/manipulation/editing
Image and video understanding
Multi-modality
vision and language
Generative human
neural motion synthesis
Human modeling (2D and 3D pose estimation, hand pose estimation, human shape estimation)
Facial analysis
Strong coding skills in C/C++ and Python

Other signals

applied research in Generative AI and CV/Multimodal Understanding
delivering intelligent solutions to ByteDance products
generative models for content creation, image generation, video synthesis, intelligent image/video editing, and virtual humans

Read full job description

Team Intro: The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, and delivering intelligent solutions to ByteDance products, enabling users to make and share creative content in a much easier way. The team has research groups dedicated to generative models for content creation, image generation, video synthesis, intelligent image/video editing, and virtual humans.

Responsibilities: • Conduct cutting-edge research and development in computer vision, generative AI and other related fields; • Transfer advanced technologies to Bytedance products; • Explore new products with artificial intelligence technology at its core.

Requirements

Minimum Qualifications: • 2+ years of research and practical experience in one or more areas of computer vision:

Generative models, diffusion models, GAN
Image and video synthesis/manipulation/editing, Image and video understanding
Multi-modality, vision and language
Generative human, neural motion synthesis, Human modeling (2D and 3D pose estimation, hand pose estimation, human shape estimation)
Facial analysis • Highly competent in algorithms and programming; Strong coding skills in C/C++ and Python.

Preferred Qualifications: • Candidates with publications in venues such as CVPR, ICCV, ECCP, SIGGRAPH, & NeurIPS.