Software Engineer Ii, Search Science Data Infra

Amazon Amazon · Big Tech · Palo Alto, CA · Software Development

The Software Engineer II, Search Science Data Infra role at Amazon focuses on building and managing infrastructure for ML model training data and feature stores. This involves developing services at the intersection of machine learning, big data, and distributed systems, managing the ML lifecycle, and processing large volumes of data for ML services. The role supports hundreds of science teams across Amazon and powers various search functionalities.

What you'd actually do

  1. Lead development of services and infrastructure at the intersection of machine learning, big data, and distributed systems.
  2. Our products and services empower hundreds of science teams across Amazon to deliver machine learning at scale for ML model training, Feature engineering and Data quality monitoring.
  3. You will be at the center stage for managing machine learning lifecycle and operations using AWS AI services, DL compute resources, and our core search backend services for query understanding, semantic matching, and relevance ranking.
  4. You will drive to provide a world class platform for Amazon Search engineers to comprehensively observe and introspect their applications and services both pre and post deployment to our large scale inference services.
  5. You will build scalable data-intensive infrastructure that processes huge amounts of logs, catalogs, transactional data, and telemetry signals.

Skills

Required

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • 1+ years of software development engineer or related occupational experience
  • 1+ years of designing and developing large-scale, multi-tiered, multi-threaded, embedded or distributed software applications, tools, systems, and services using: C#, C++, Java, or Perl experience
  • 1+ years of Object Oriented Design experience
  • Experience programming with at least one software programming language
  • Bachelor's degree in Computer Science, Engineering, Mathematics, or a related field

Nice to have

  • Master's degree or above in computer science or equivalent
  • Experience in operational excellence, security compliance, and distributed systems resiliency
  • Experience working with customers with a passion for delivering exceptional service, or experience that includes strong analytical skills, attention to detail, and effective communication abilities
  • Experience with training and deploying machine learning systems to solve large-scale optimizations, or experience operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets

What the JD emphasized

  • ML model training data
  • Feature Store infrastructure
  • ML lifecycle and operations
  • large-scale data-intensive infrastructure

Other signals

  • ML model training data
  • Feature Store infrastructure
  • ML lifecycle and operations
  • large-scale data-intensive infrastructure