Principal Software Development Engineer, Aws Mantle

Amazon Amazon · Big Tech · Seattle, WA · Software Development

Principal Software Development Engineer for AWS Mantle team, focusing on building and scaling a distributed inference engine for foundation models on Amazon Bedrock. The role involves defining technical vision, owning system design, influencing strategy, and ensuring high performance, reliability, and security for millions of customers.

What you'd actually do

  1. Set the long-term technical direction for a globally distributed, high-performance ML inference platform serving models from industry-leading AI providers
  2. Own end-to-end system design decisions that directly impact latency, reliability, and scalability for millions of customers worldwide
  3. Influence engineering strategy across Amazon Bedrock, partnering with senior leadership to align technical investments with business outcomes
  4. Raise the engineering bar through exemplary system design, mentorship, and contributions to the broader AWS engineering community
  5. Navigate complex trade-offs across performance, security, and cost while maintaining the highest standards for operational excellence

Skills

Required

  • 10+ years of non-internship professional software development experience
  • 10+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Bachelor's degree in Computer Science, Engineering, a related field, or equivalent experience
  • 8+ years of programming experience with at least one modern language such as Java, C++, Python, Go, or Rust
  • Experience driving cross-organizational technical strategy and delivering results in complex, ambiguous environments where the business problem and technical approach are not pre-defined

Nice to have

  • Master's degree or equivalent in computer science, machine learning, engineering, or related fields, or PhD
  • Experience building large-scale machine learning and AI solutions at Internet scale
  • Experience working with Advanced Compute technologies including, but not limited to: Accelerated Compute, High Performance Compute, Visual/Spatial Compute, and/or IoT.
  • Experience writing and publishing technical documents or equivalent
  • Familiarity with inference frameworks such as vLLM, TensorRT, or Triton Inference Server

What the JD emphasized

  • millisecond-level latency
  • zero-trust security
  • Zero Operator Access (ZOA) security guarantees
  • global availability and performance SLAs

Other signals

  • Distributed inference engine
  • Serving foundation models
  • High-performance ML systems
  • Scalability and reliability