Software Dev Engineer, Aws Identity Analytics Platform

Amazon Amazon · Big Tech · Seattle, WA · Software Development

Software Development Engineer role focused on building and operating the data platform infrastructure for an AI-driven analytics platform at AWS Identity. This involves designing and managing ingestion, transformation, and serving pipelines for petabyte-scale data to feed ML models and LLM agents. The role also includes productionizing ML models, building feature engineering infrastructure, and ensuring platform resilience and scalability.

What you'd actually do

  1. Design, build, and operate scalable data ingestion, transformation, and loading pipelines that process petabyte-scale Identity logs, metrics, and policy data from IAM, STS, and other AWS Identity services — using services such as AWS Glue, EMR, Spark, Athena, S3, and Redshift.
  2. Own the productionization lifecycle for ML models developed by the Applied Scientist: package, deploy, monitor, and maintain models in production environments using SageMaker, ECS, and EKS — ensuring reliability, latency, and scalability meet production standards.
  3. Build and maintain the feature engineering infrastructure that transforms raw Identity data into structured datasets ready for ML training, evaluation, and inference.
  4. Drive platform resilience and operational excellence — designing for failure, building robust monitoring and alerting, reducing operational load through automation, and ensuring the platform scales automatically to the demands of incoming data.
  5. Partner with the Applied Scientist, BIEs, and product managers to understand analytical requirements, design data models that support both current and future use cases, and ensure the platform evolves ahead of customer needs.

Skills

Required

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • 1+ years of designing and developing large-scale, multi-tiered, multi-threaded, embedded or distributed software applications, tools, systems, and services using: C#, C++, Java, or Perl experience
  • Bachelor's degree or foreign equivalent in Computer Science, Engineering, Mathematics, or a related field

Nice to have

  • 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Bachelor's degree in computer science or equivalent
  • Knowledge of Machine Learning and LLM fundamentals, including transformer architecture, training/inference lifecycles, and optimization techniques
  • Experience developing, deploying and managing AI products at scale

What the JD emphasized

  • petabyte-scale data
  • AI-driven analytics platform
  • ML models
  • LLM agents
  • production-grade systems
  • feature engineering infrastructure

Other signals

  • AI-driven analytics platform
  • ML models, LLM agents
  • petabyte-scale data processing
  • production-grade systems
  • feature engineering infrastructure