Software Development Engineer, Alexa AI

Amazon Amazon · Big Tech · Bellevue, WA · Software Development

Software Development Engineer to design, build, and operate production systems for Alexa's AI runtime, focusing on agent communication platforms, intelligent request routing, and unified agent ingress. The role involves building infrastructure for AI agents and transforming LLM outputs into real-time responses.

What you'd actually do

  1. Design, implement, and operate distributed systems that power Alexa's AI runtime at scale, including response generation, agent communication, and AI service infrastructure
  2. Contribute to the design and implementation of agent platform primitives: SDKs, agent registries, and communication frameworks used by multiple partner teams
  3. Build and maintain production-grade services with strong operational health — monitoring, alerting, runbooks, and automated remediation
  4. Collaborate with partner teams across the organization to define API contracts, align on system architecture, and unblock cross-team dependencies
  5. Participate in on-call rotations and contribute to operational excellence, including error rate reduction, incident response, and runbook automation

Skills

Required

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Bachelor's degree or foreign equivalent in Computer Science, Engineering, Mathematics, or a related field
  • Experience programming with at least one software programming language

Nice to have

  • Master's degree or equivalent
  • Experience designing, building, operating, and managing large-scale distributed systems or web services
  • Experience in machine learning, data mining, information retrieval, statistics or natural language processing, or experience in developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware
  • Experience or certifications in API design, cloud architecture/deployment, service-oriented architecture, mobile development, performance optimization, databases design and related fields
  • Experience with streaming systems, real-time data pipelines, or event-driven architectures (e.g., Kinesis, EventBridge, Kafka, or equivalent)
  • Familiarity with agentic frameworks, agent communication protocols, or multi-agent orchestration systems
  • Strong operational instincts — experience owning production systems on-call, reducing error rates, and building runbook automation

What the JD emphasized

  • agent communication
  • AI runtime
  • agent platform
  • LLMs in production

Other signals

  • Agent communication platform
  • Intelligent request routing
  • Unified agent ingress and event delivery
  • Agent platform primitives
  • LLMs in production