Applied Scientist, Fintelligence

Amazon Amazon · Big Tech · Bellevue, WA · Applied Science

This role focuses on building and deploying generative AI applications within Amazon's FinTech organization. The primary responsibility is to create AI systems and autonomous agents that process financial transactions, extract intelligence from documents, and learn from user interactions. The role involves solving inference at scale, developing robust evaluation frameworks, and ensuring the AI systems are trustworthy and compliant for finance teams. The work spans from model architecture to deployment and customer workflow impact, with a strong emphasis on shipping production-ready AI solutions.

What you'd actually do

  1. Building AI systems that finance teams trust enough to rely on without manual review, where precision isn't a nice-to-have, it's a compliance requirement.
  2. Designing agents that learn from user corrections and get measurably better with every interaction, not just at the next model release.
  3. Solving inference at massive scale using tiered model architectures, intelligent routing, and small language models that deliver production-grade accuracy at a fraction of frontier model cost.
  4. Developing evaluation frameworks that catch quality regressions before customers do and gate every model change before it ships.

Skills

Required

  • PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
  • Experience building machine learning models or developing algorithms for business application
  • Experience programming in Java, C++, Python or related language
  • 3+ years of building models for business application experience

Nice to have

  • PhD in computer science, machine learning, engineering, or related fields
  • Experience in building speech recognition, machine translation and natural language processing systems (e.g., commercial speech products or government speech projects)
  • Experience in patents or publications at top-tier peer-reviewed conferences or journals

What the JD emphasized

  • compliance requirement
  • measurably better
  • massive scale
  • production-grade accuracy
  • evaluation frameworks
  • ships to production
  • genuinely hard
  • financial data is messy, regulated, high-stakes
  • naive LLM approaches break down

Other signals

  • building AI systems that finance teams trust enough to rely on without manual review
  • designing agents that learn from user corrections and get measurably better with every interaction
  • solving inference at massive scale using tiered model architectures, intelligent routing, and small language models
  • developing evaluation frameworks that catch quality regressions before customers do and gate every model change before it ships