Applied Scientist Ii, Selection Monitoring, Selection Monitoring

Amazon Amazon · Big Tech · IN, KA, Bengaluru · Applied Science

This role focuses on developing and deploying advanced ML/AI technologies for Amazon's catalog expansion, involving information extraction, web crawling, and comprehension using deep learning, NLP, and image processing. The scientist will build scalable systems to extract and structure information from the web, including visually rich documents, and work with LLMs/SLMs and agentic systems. The role requires a strong foundation in ML model development, deployment, and collaboration with software engineering teams.

What you'd actually do

  1. Use AI, NLP and advances in LLMs/SLMs and agentic systems to create scalable solutions for business problems.
  2. Efficiently Crawl web, Automate extraction of relevant information from large amounts of Visually Rich Documents and optimize key processes.
  3. Design, develop, evaluate and deploy, innovative and highly scalable ML models.
  4. Work closely with software engineering teams to drive real-time model implementations.
  5. Establish scalable, efficient, automated processes for large scale model development, model validation and model maintenance.

Skills

Required

  • building ML models for business application
  • PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
  • programming in Python, Java, C++ or related languages
  • algorithms and data structures
  • parsing
  • numerical optimization
  • data mining
  • parallel and distributed computing
  • high-performance computing

Nice to have

  • Unix/Linux
  • professional software development
  • patents or publications at top-tier peer-reviewed conferences or journals

What the JD emphasized

  • advanced ML/AI technologies
  • drive solutions from research, prototype, design, coding and deployment
  • tackle challenging problems
  • depth and breadth of knowledge
  • programming and design skills
  • Scale
  • Accuracy
  • Speed
  • Diversity
  • Build a scalable system
  • Intelligently cluster web pages, segment and classify regions, extract relevant information and structure the data available on semi-structured web.
  • Build systems that will use existing Knowledge Base to perform open information extraction at scale from visually rich documents.
  • scalable solutions
  • large amounts of Visually Rich Documents
  • highly scalable ML models
  • real-time model implementations
  • large scale model development
  • model validation
  • model maintenance
  • Lead projects
  • mentor other scientists, engineers
  • Publish innovation in research forums
  • building ML models for business application
  • publications at top-tier peer-reviewed conferences or journals

Other signals

  • Develop advanced ML/AI technologies
  • Drive solutions from research, prototype, design, coding and deployment
  • Build scalable systems
  • Design, develop, evaluate and deploy, innovative and highly scalable ML models