Associate Ii, ML Data Operations, Go-ai Operations

Amazon Amazon · Big Tech · IN, TS, Hyderabad · Remote · Machine Learning Science

This role is for an Associate II, ML Data Operations at Amazon Robotics, focusing on data annotation for training and validating machine learning models. It's a non-technical role that requires executing data annotation tasks across various modalities (text, image, video, audio) to support LLM capabilities and robotics development. The role involves analyzing processes, ensuring data quality, and contributing to operational improvements.

What you'd actually do

  1. Perform precise and consistent annotations across multiple data types (image, video, and text). This includes mastering techniques like object detection, semantic segmentation (pixel-level labeling), object tracking in video, and open-text evaluation.
  2. Proactively identify and correct errors, ensuring high data integrity and making obsessive precision crucial for model training.
  3. Write grammatically correct, creative, and technical texts in various styles, strictly adhering to complex project guidelines
  4. Make strong judgment to address ambiguous situations or incomplete information and, when guidelines fail, propose logical, consistent solutions that contribute to process improvement.
  5. Quickly learn and efficiently utilize various specialized annotation tools and platforms, adapting to new methodologies as required by evolving programs in domains like packaging, manipulation, storage, and sortation automation.

Skills

Required

  • Bachelor’s degree in any discipline with 0.6 - 5 years of experience working with data transcription and annotation.
  • English proficiency (C1+ or > C1 fluency) with good business writing skills
  • Demonstrate proficiency in generating high quality human insight data across a range of modalities, inclusive of text, image video and audio.
  • Specializing in LLM annotation work using amazon internal generative AI tools.
  • Proficient research skills with experience to write basic prompts to gather and synthesizing information from multiple sources
  • Excellent organizational and time management skills to prioritize complex tasks effectively.
  • Comfortable working in a collaborative environment with remote, multi-cultural teams, willing to share knowledge, and able to maintain individual productivity goals.
  • Good communication skills with the ability to articulate complex ideas and provide clear explanations.
  • Work in a flexible schedule/shift/work area, including weekends, nights, and/or holidays

Nice to have

  • opportunity to transition into a full-time position based on performance and business requirements

What the JD emphasized

  • non-technical role
  • human-in-the-loop expert
  • data annotation tasks
  • training and validation of machine learning models
  • quality and integrity of data
  • frontier AI improvements
  • precise text/video/image annotation
  • non-technical operational role
  • deliver high-quality training data
  • process oriented, not programming skills
  • perform precise annotation tasks
  • mastering techniques like object detection, semantic segmentation (pixel-level labeling), object tracking in video, and open-text evaluation
  • obsessive precision crucial for model training
  • strictly adhering to complex project guidelines
  • strong judgment to address ambiguous situations or incomplete information
  • quickly learn and efficiently utilize various specialized annotation tools and platforms
  • foundational labelling functions
  • dialogue evaluation on speech, text, audio, and video data
  • strong concentration
  • effective multitasking
  • familiarity to write basic prompts
  • dive deep into use case and interpret and implement solutions

Other signals

  • data annotation
  • human-in-the-loop
  • LLM training data