Machine Learning Engineer, AI Coding Tools

ByteDance ByteDance · Big Tech · San Jose, CA · R&D

Machine Learning Engineer for an AI coding tool (TRAE) that acts as an intelligent engineer for software development. The role focuses on model training, optimization, quantization, and deployment of LLMs for code generation and reasoning capabilities, working on both training and deployment pipelines.

What you'd actually do

  1. Design, train, and fine-tune large language models (LLMs) that support TRAE’s core reasoning and code generation capabilities.
  2. Build efficient, stable, and scalable model training and evaluation pipelines.
  3. Collaborate with infrastructure and product teams to deploy and monitor models efficiently on GPU clusters.
  4. Continuously optimize models for latency, throughput, and accuracy.
  5. Stay up to date with and apply cutting-edge techniques in large model optimization and inference acceleration.

Skills

Required

  • PyTorch or TensorFlow
  • Python
  • C++/CUDA
  • large-scale distributed training
  • mixed-precision training
  • model parallelism
  • model quantization
  • pruning
  • CUDA-based deployment optimization

Nice to have

  • PhD in AI related research fields
  • developing and deploying AI systems in a real-world setting

What the JD emphasized

  • end-to-end software generation capabilities
  • intelligent engineering capabilities
  • LLM training and deployment for the software engineering
  • production-ready AI systems

Other signals

  • end-to-end software generation capabilities
  • intelligent engineering capabilities
  • LLM training and deployment for the software engineering