Machine Learning Engineer - Large Language Models & Generative AI Inference

Apple Apple · Big Tech · Cupertino, CA +1 · Machine Learning and AI

Machine Learning Engineer focused on the inference platform for Large Language Models and Generative AI, working with foundation models and client teams to enhance user experiences across Apple's operating systems. The role involves translating research into high-performing systems and optimizing the model serving stack.

What you'd actually do

  1. Leading the exploration and application of Large Language Models and Generative AI, venturing into new areas within these fields.
  2. Translating the latest research into high-performing systems and a model serving stack that can be practically applied to enhance user experiences.
  3. Collaborating with various teams to develop and implement evolving requirements of our clients on the GenAI inference stack, ensuring performance optimization and alignment with broader business goals.

Skills

Required

  • Machine Learning
  • Large Language Models (LLMs)
  • Generative AI
  • high-performance systems computing
  • model serving stack
  • inference optimization

Nice to have

  • Published research in Machine Learning or AI
  • Advanced degree (Master’s or Ph.D.) in Computer Science, Artificial Intelligence, Machine Learning, or a related field
  • Ongoing professional development in Machine Learning and Artificial Intelligence domains

What the JD emphasized

  • In-depth experience in Machine Learning, with a particular emphasis on Large Language Models (LLMs) and Generative AI.
  • Proven ability to comprehend, interpret, and apply cutting-edge research into tangible applications.

Other signals

  • inference platform
  • large language models
  • generative AI
  • high-performance systems computing
  • model serving stack