Research Scientist in Large Language Model (llm)-seed

ByteDance · Big Tech · San Jose, CA · Algorithm

Research Scientist role focused on advancing next-generation LLMs, including pre-training, post-training, inference, and interpretability. The role involves exploring large-scale models, optimizing systems, data construction, instruction tuning, preference alignment, and improving model capabilities like reasoning and code generation.

What you'd actually do

Explore large-scale models and optimize systems;
Data construction, instruction tuning, preference alignment, and model optimization;
Improving relevant model capabilities, such as reasoning, code, math etc;
In-depth research and exploration of future use cases;

Skills

Required

Excellent coding ability
data structures
fundamental algorithm skills
proficient in C/C++ or Python
Excellent problem analysis and solving skills
able to solve problems in large-scale model training and application
Good communication and collaboration skills
able to explore new technologies with the team and promote technological progress

Nice to have

Experience with large-scale model training
deep learning
machine learning algorithms
Candidates who have led influential projects or papers in the field of large-scale models

What the JD emphasized

large-scale model training
deep learning
machine learning algorithms
influential projects or papers in the field of large-scale models

Other signals

advancing the next generation of LLMs
solving fundamental challenges in LLM development
model pre-training, post-training, inference, memory capabilities, learning, interpretability
explore methods to enhance applications through technological innovation

Read full job description

Team Info: The Seed LLM team is dedicated to advancing the next generation of LLMs and solving, fundamental challenges in LLM development. Our areas of focus include model pre-training, post-training, inference, memory capabilities, learning, interpretability and other related directions. We dive deep into the latest technologies and create comprehensive solutions from concept to completion. In our endeavor to adopt LLMs in real-life scenarios, we explore methods to enhance applications through technological innovation.

The main job directions include:

Explore large-scale models and optimize systems;
Data construction, instruction tuning, preference alignment, and model optimization;
Improving relevant model capabilities, such as reasoning, code, math etc;
In-depth research and exploration of future use cases;

Requirements

Minimum Qualifications

Excellent coding ability, data structures, and fundamental algorithm skills, proficient in C/C++ or Python;
Excellent problem analysis and solving skills, able to solve problems in large-scale model training and application;
Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress.

Preferred Qualifications:

Experience with large-scale model training, deep learning and machine learning algorithms are preferred;
Candidates who have led influential projects or papers in the field of large-scale models are preferred;