What you'd actually do

lead the design and development of agentic evaluation frameworks and evaluation/critic model training that assess the quality and effectiveness of AI agents at scale.

define evaluation methodologies, create benchmarks, and build evaluation models and automated systems that measure agent performance across critical dimensions.

stay at the forefront of the rapidly evolving field by studying and adopting state-of-the-art methods, conducting original research to advance the science of agent and evaluation.

own the end-to-end lifecycle from research and data curation through model training to production deployment, working closely with engineering to deliver evaluation capabilities as managed AWS services.

collaborate with cross-functional stakeholders to translate science insights into actionable improvements, mentor junior scientists, and contribute to the broader research community.

Skills

Required

building machine learning models for business application experience
PhD, or Master's degree and 6+ years of applied research experience
Experience programming in Java, C++, Python or related language
Experience with neural deep learning methods and machine learning

Nice to have

modeling tools such as R, scikit-learn, Spark MLLib, MxNet, Tensorflow, numpy, scipy etc.
large scale distributed systems such as Hadoop, Spark etc.

Other signals

leading the design and development of agentic evaluation frameworks

training evaluation/critic models

defining evaluation methodologies and creating benchmarks

building evaluation models and automated systems

researching and building innovative solutions using Agentic AI

delivering evaluation capabilities as managed AWS services

Amazon is looking for a passionate, talented, and inventive Applied Scientist with a strong machine learning background to help build industry-leading technology in generative AI and foundational models.

As part of our AI team in Amazon AWS, you will work alongside internationally recognized experts to develop novel algorithms and modeling techniques to advance the state-of-the-art in generative AI. Your work will directly impact millions of our customers in the form of products and services that make use of speech, vision and language technology. You will gain hands on experience with Amazon’s heterogeneous speech, text, image and structured data sources, and large-scale computing resources to accelerate advances in machine learning and foundation models. More specifically, you will have the opportunity to impact millions of our customers by researching and building innovative solutions using Agentic AI.

Agentic AI drives innovation at the forefront of artificial intelligence, enabling customers to transform their businesses through generative AI solutions. We build and deliver the foundational AI services that power the future of cloud computing, helping organizations harness the potential of AI to solve their most complex challenges. Join our dynamic team of AI/ML practitioners and applied scientists who work backwards from customer needs to create novel technologies. If you're passionate about shaping the future of AI while making a meaningful impact for customers worldwide, we want to hear from you.

Key job responsibilities The Senior Applied Scientist will lead the design and development of agentic evaluation frameworks and evaluation/critic model training that assess the quality and effectiveness of AI agents at scale. They will define evaluation methodologies, create benchmarks, and build evaluation models and automated systems that measure agent performance across critical dimensions. The scientist will stay at the forefront of the rapidly evolving field by studying and adopting state-of-the-art methods, conducting original research to advance the science of agent and evaluation. They will own the end-to-end lifecycle from research and data curation through model training to production deployment, working closely with engineering to deliver evaluation capabilities as managed AWS services. They will collaborate with cross-functional stakeholders to translate science insights into actionable improvements, mentor junior scientists, and contribute to the broader research community.

A day in the life A day in the life Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon) conferences, inspire us to never stop embracing our uniqueness.

Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

Basic Qualifications

3+ years of building machine learning models for business application experience
PhD, or Master's degree and 6+ years of applied research experience
Experience programming in Java, C++, Python or related language
Experience with neural deep learning methods and machine learning

Preferred Qualifications

Experience with modeling tools such as R, scikit-learn, Spark MLLib, MxNet, Tensorflow, numpy, scipy etc.
Experience with large scale distributed systems such as Hadoop, Spark etc.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, CA, Santa Clara - 192,200.00 - 260,000.00 USD annually

Basic Qualifications

3+ years of building machine learning models for business application experience
PhD, or Master's degree and 6+ years of applied research experience
Experience programming in Java, C++, Python or related language
Experience with neural deep learning methods and machine learning

Preferred Qualifications

Experience with modeling tools such as R, scikit-learn, Spark MLLib, MxNet, Tensorflow, numpy, scipy etc.
Experience with large scale distributed systems such as Hadoop, Spark etc.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

USA, CA, Santa Clara - 192,200.00 - 260,000.00 USD annually

Senior Applied Scientist, Amazon Aws Agentic Ai, Aws AI Fundamental Research

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Basic Qualifications

Preferred Qualifications

Basic Qualifications

Preferred Qualifications