Machine Learning Engineer, Alexa AI

Amazon Amazon · Big Tech · Boston, MA · Software Development

Machine Learning Engineer for Alexa AI focused on LLM training, production deployment, and inference optimizations. Will collaborate with Applied Scientists and other MLEs to leverage Amazon's data and computing resources for Generative AI solutions. Responsibilities include investigating design approaches, prototyping, evaluating technical feasibility, processing data, scaling ML models, and delivering high-quality software in an Agile environment. Experience with PyTorch/JAX, vLLM, SGLang, TensorRT, and developing large model hosting platforms is preferred.

What you'd actually do

  1. Will work with other team engineers to investigate design approaches, prototype new technology and evaluate technical feasibility.
  2. Work closely with Applied scientists to process data, scale machine learning models
  3. Will work in an Agile/Scrum environment to deliver high quality software.

Skills

Required

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience working with PyTorch or JAX software
  • Bachelor's degree or foreign equivalent in Computer Science, Engineering, Mathematics, or a related field

Nice to have

  • Experience working with PyTorch or JAX software, or experience with vLLM, SGLang, TensorRT or similar platforms in production environments
  • Experience developing large model hosting platforms, establishing frameworks, and scaling and optimizing inference system.
  • Experience developing and maintaining MLOps tool in large organizations.

What the JD emphasized

  • strong machine learning background
  • delivering new features and products
  • absolute requirements
  • exceptional technical expertise
  • sound understanding of the fundamentals of Computer Science and Machine Learning
  • thrived and succeeded in delivering high quality technology products/services in a hyper-growth environment

Other signals

  • LLM Inference
  • LLM training
  • production deployment
  • optimizations
  • Generative Artificial Intelligence solutions
  • large model hosting platforms
  • scaling and optimizing inference system