Software Engineer Ii, Search Science Data Infra

Amazon Amazon · Big Tech · Palo Alto, CA · Software Development

Software Engineer II, Search Science Data Infra at Amazon, focusing on building and managing ML training data pipelines, feature stores, and infrastructure for Amazon Search. This role involves working with big data, distributed systems, and AWS AI services to support ML model training and inference for search ranking, matching, and personalization.

What you'd actually do

  1. Lead development of services and infrastructure at the intersection of machine learning, big data, and distributed systems.
  2. Our products and services empower hundreds of science teams across Amazon to deliver machine learning at scale for ML model training, Feature engineering and Data quality monitoring.
  3. You will be at the center stage for managing machine learning lifecycle and operations using AWS AI services, DL compute resources, and our core search backend services for query understanding, semantic matching, and relevance ranking.
  4. You will drive to provide a world class platform for Amazon Search engineers to comprehensively observe and introspect their applications and services both pre and post deployment to our large scale inference services.
  5. You will build scalable data-intensive infrastructure that processes huge amounts of logs, catalogs, transactional data, and telemetry signals.

Skills

Required

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • 1+ years of software development engineer or related occupational experience
  • 1+ years of designing and developing large-scale, multi-tiered, multi-threaded, embedded or distributed software applications, tools, systems, and services using: C#, C++, Java, or Perl experience
  • 1+ years of Object Oriented Design experience
  • Bachelor's degree or foreign equivalent in Computer Science, Engineering, Mathematics, or a related field
  • Experience programming with at least one software programming language

Nice to have

  • Master's degree in computer science or equivalent
  • Experience with training and deploying machine learning systems to solve large-scale optimizations, or experience operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
  • Experience in operational excellence, security compliance, and distributed systems resiliency
  • Experience working with customers with a passion for delivering exceptional service, or experience that includes strong analytical skills, attention to detail, and effective communication abilities

What the JD emphasized

  • ML model training data
  • Feature Store infrastructure
  • ML lifecycle and operations
  • large scale inference services

Other signals

  • ML model training data
  • Feature store infrastructure
  • ML lifecycle and operations
  • AWS AI services
  • DL compute resources
  • large scale inference services
  • data-intensive infrastructure