Sr. Machine Learning Engineer, Amazon Traffic Engineering

Amazon Amazon · Big Tech · CA, BC +1 · Software Development

This role focuses on building and operating critical infrastructure services for Amazon's retail platform, dealing with massive traffic volumes in real-time using advanced machine learning systems and distributed infrastructure. The core responsibility involves improving operational excellence, performance, and monitoring of these services.

What you'd actually do

  1. Collaborate within a team and across other teams to launch best in class solutions.
  2. Improve operational excellence by driving performance and monitoring features.
  3. Troubleshoot problems, and in turn recommend and implement industry-leading solutions.
  4. Learn and be curious, and in turn teach, mentor and grow other engineers.

Skills

Required

  • 5+ years of programming with at least one software programming language experience
  • 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Experience with Machine Learning and Large Language Model fundamentals, including architecture, training/inference lifecycles, and optimization of model execution

Nice to have

  • 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Knowledge of machine learning model architecture and inference
  • Bachelor's degree in computer science or equivalent

What the JD emphasized

  • Machine Learning and Large Language Model fundamentals, including architecture, training/inference lifecycles, and optimization of model execution
  • Knowledge of machine learning model architecture and inference

Other signals

  • operates at unprecedented scale
  • massive volumes of traffic in real-time
  • advanced machine learning systems
  • distributed infrastructure