Software Development Engineer Ii, Skg Team

Amazon Amazon · Big Tech · Seattle, WA · Software Development

Software Development Engineer II on the SKG team within AWS Infrastructure Services, focusing on building intelligent systems to automate data center alarm triage and analysis using generative AI. The role involves integrating with services like Amazon Bedrock for summarization and insight generation, and contributing to an automated triage and ticketing engine. This is an applied engineering role focused on integrating and orchestrating AI services, not building or training models, with a strong emphasis on end-to-end feature ownership, design, deployment, and operation in a production environment.

What you'd actually do

  1. Build the services in the Data Center Alarming Platform that turn an alarm firehose into actionable signal: suppression, correlation, and deduplication logic running across hundreds of data centers. You'll own features end-to-end and make the implementation decisions yourself
  2. Build integrations with Amazon Bedrock for natural-language alarm summarization, pattern detection, and triage assistance. This is an applied engineering role — you'll integrate and orchestrate AI services, not build or train models. Expect to prototype, measure, and iterate
  3. Contribute to the Automated Triage and Ticketing Engine: services that classify, prioritize, and route alarms to the appropriate data center engineering operations teams with contextual information for faster resolution
  4. Define infrastructure as code using AWS CDK. You'll write the constructs, wire up the pipelines, and own the operational health of what you deploy
  5. Participate in design reviews — both authoring designs for your own work and providing meaningful feedback on others'. You're expected to bring your own proposals and trade-off analysis

Skills

Required

  • TypeScript
  • Python
  • AWS CDK
  • Lambda
  • DynamoDB
  • SQS
  • SNS
  • S3
  • CloudWatch
  • Amazon Bedrock
  • API Gateway
  • Route 53
  • CI/CD
  • design, code, test, deploy, operate
  • mentoring junior engineers

Nice to have

  • experience with data center operations

What the JD emphasized

  • build integrations with Amazon Bedrock
  • integrate and orchestrate AI services
  • prototype, measure, and iterate
  • own features end-to-end
  • deploy to production multiple times per week

Other signals

  • integrating and orchestrating AI services
  • applying generative AI to surface actionable insights
  • natural-language alarm summarization
  • prototype, measure, and iterate