Research Scientist in Large Language Model (llm)-seed

ByteDance ByteDance · Big Tech · San Jose, CA · Algorithm

Research Scientist role focused on advancing next-generation LLMs, including pre-training, post-training, inference, and interpretability. The role involves exploring large-scale models, optimizing systems, data construction, instruction tuning, preference alignment, and improving model capabilities like reasoning and code generation.

What you'd actually do

  1. Explore large-scale models and optimize systems;
  2. Data construction, instruction tuning, preference alignment, and model optimization;
  3. Improving relevant model capabilities, such as reasoning, code, math etc;
  4. In-depth research and exploration of future use cases;

Skills

Required

  • Excellent coding ability
  • data structures
  • fundamental algorithm skills
  • proficient in C/C++ or Python
  • Excellent problem analysis and solving skills
  • able to solve problems in large-scale model training and application
  • Good communication and collaboration skills
  • able to explore new technologies with the team and promote technological progress

Nice to have

  • Experience with large-scale model training
  • deep learning
  • machine learning algorithms
  • Candidates who have led influential projects or papers in the field of large-scale models

What the JD emphasized

  • large-scale model training
  • deep learning
  • machine learning algorithms
  • influential projects or papers in the field of large-scale models

Other signals

  • advancing the next generation of LLMs
  • solving fundamental challenges in LLM development
  • model pre-training, post-training, inference, memory capabilities, learning, interpretability
  • explore methods to enhance applications through technological innovation