Senior Software Development Engineer, Skg Team

Amazon Amazon · Big Tech · Seattle, WA · Software Development

Senior Software Development Engineer to build intelligent systems for AWS Infrastructure Services using Generative AI to automate alarm triage and analysis in data centers. This role involves integrating AI services like Amazon Bedrock, designing and building new platforms, and providing technical leadership to a team of engineers. The focus is on applying AI to solve operational problems, not on core AI research or model training.

What you'd actually do

  1. Design and build the Alarm Notification Reduction Platform: systems that intelligently suppress, correlate, and deduplicate millions of alarms across hundreds of data centers, turning a firehose into a focused, actionable signal stream for operations engineers
  2. Apply Amazon Bedrock to historically intractable operational problems: natural-language summaries of complex alarm sequences, automated root cause hypotheses, and pattern detection across alarm data. This is Invent and Simplify in practice. This is an applied engineering role — you'll be integrating and orchestrating AI services, not building or training models
  3. Own the Automated Triage and Ticketing Engine: systems that automatically classify, prioritize, and route alarms to the appropriate data center engineering operations teams with contextual information for faster resolution
  4. Define and evolve cloud infrastructure using AWS CDK, ensuring deployment pipelines are robust, repeatable, and secure across multiple regions
  5. Drive architectural decisions for next-generation systems, evaluate build-vs-integrate tradeoffs, define API contracts with partner teams, and author design documents that set multi-quarter direction

Skills

Required

  • Software design and architecture
  • Cloud infrastructure development (AWS CDK)
  • Integration of AI services (Amazon Bedrock)
  • System design and technical leadership
  • Mentoring junior engineers
  • TypeScript
  • Python

Nice to have

  • Experience with CI/CD pipelines
  • Familiarity with operational data and alarm systems

What the JD emphasized

  • Generative AI hasn't been productionized yet
  • shapes the direction
  • gets GenAI into production
  • build something from a position of real influence
  • Strategy is still being defined
  • you'll build one
  • Input data quality from upstream systems is rough and ripe for optimization
  • GenAI is unproven here
  • not building or training models
  • Actively mentor the team of 8 SDE1s and SDE2s, lead design reviews, raise the bar on engineering practices, and foster a culture where engineers grow in both skill and ownership. This is a core requirement, not a nice-to-have

Other signals

  • Generative AI hasn't been productionized yet.
  • The opportunity is to be the person who shapes the direction, levels up the team, and gets GenAI into production for real data center operations problems.
  • This is an applied engineering role — you'll be integrating and orchestrating AI services, not building or training models