Tech Lead - Machine Learning Platform Engineer

ByteDance ByteDance · Big Tech · San Jose, CA · R&D

Machine Learning Platform Engineer to develop and maintain a platform supporting deep learning models for code development, testing, training, model deployment, and other core business functions. The platform is foundational for recommendation, advertising, and search systems, involving recommended systems and distributed training of large-scale deep learning models.

What you'd actually do

  1. Design, develop, and maintain core functions of the machine learning platform, including data processing, feature engineering, model training, model insights, and model deployment.
  2. Build efficient, scalable data processing and computing processes to ensure platform stability and performance.
  3. Collaborate with data scientists and machine learning engineers to understand their requirements and provide tool and technical support.
  4. Work closely with system administrators and software engineers to ensure smooth platform integration with other systems.
  5. Responsible for managing and monitoring the operation of the machine learning platform and troubleshooting any platform-related issues.

Skills

Required

  • software engineering
  • Python
  • Golang
  • distributed computing
  • large-scale data processing
  • Hadoop
  • Spark
  • communication
  • teamwork
  • analytical skills
  • problem-solving skills

Nice to have

  • machine learning platform engineering

What the JD emphasized

  • At least 3 years of experience in software engineering
  • In-depth understanding of distributed computing and principles and technologies of large-scale data processing

Other signals

  • machine learning platform
  • deep learning models
  • model deployment
  • recommendation systems
  • distributed training