Senior AI Solution Architect

Amazon Amazon · Big Tech · Seoul, South Korea · Solutions Architect

This role is for a Senior AI Specialist Solutions Architect at AWS, focusing on helping customers adopt and scale GenAI/ML and Agentic technologies. The architect will build technical relationships, provide architectural guidance on security, cost, performance, and operational efficiency, and act as a voice of the customer internally. Responsibilities include crafting scalable architectures, creating technical content, and leading deep dives and workshops. The role requires experience in designing and implementing production AI systems, including LLMs, multi-modal FMs, fine-tuning, inference, RAG, FM evaluation, Vector DBs, Agentic workflows, and MLOps, with hands-on experience in AWS ecosystems like Bedrock, AgentCore, and SageMaker. Experience in regulated environments is preferred.

What you'd actually do

  1. The AI Specialist SA team builds technical relationships with customers of all sizes and operate as their trusted advisor, ensuring they get the most out of the cloud at every stage of their journey while adopting GenAI/ML and Agentic technologies across their organisation.
  2. You’ll manage the overall technical relationship between AWS and our customers, making recommendations on security, cost, performance, reliability and operational efficiency to accelerate their challenging GenAI/ML and Agentic projects.
  3. In this role, your creativity will link technology to tangible solutions, with the opportunity to define cloud-native GenAI/ML and Agentic architectural patterns for a variety of use cases.
  4. You will participate in the creation and sharing of best practices, technical content and new reference architectures (e.g. white papers, code samples, blog posts) and evangelize and educate about running GenAI/ML and Agentic workloads on AWS technology (e.g. through workshops, user groups, meetups, public speaking, online videos or conferences).
  5. Lead hands-on deep dives and technical workshops, contributing reusable code, reference architectures, and internal technical assets for the broader engineering organization.

Skills

Required

  • 5+ years of design/implementation of production AI systems
  • Experience implementing AI solutions that can include integration of LLMs/multi-modal FMs in large scale systems, fine-tuning LLMs, deployment and distributed inference of LLMs, RAG, FM evaluation, Vector DBs, Agentic workflows, prompt/context engineering, and MLOps
  • Hands-on experience with AWS ecosystems (including Bedrock, AgentCore, and SageMaker) to set up secure, private-network AI environments
  • Practical experience implementing Retrieval-Augmented Generation using embeddings, vector stores, and semantic search optimization
  • Able to effectively communicate across an increasing diversity of audiences internally and externally
  • Ability to influence customer and internal business decision makers as a technical thought leader

Nice to have

  • Proven ability to lead projects with complex challenges with extensible, operationally excellent, cost optimized, and aligned solutions outcomes
  • Ability to lead a team or small organization-wide initiative with business objectives that are partially defined
  • Strong ability to determine solution strategy and where to simplify or extend solutions for the best outcome
  • Master's degree in computer science, mathematics, statistics, machine learning or equivalent quantitative field, or PhD
  • Experience in running & fine-tuning Large and Small Language Models using advanced techniques like LoRA/QLoRA, Instruction Tuning, and RLHF to optimize for specific domain tasks
  • Expertise in architecting AI systems within highly regulated or security-sensitive environments (e.g., Financial Services, Healthcare, Public Sector)

What the JD emphasized

  • production AI systems
  • Agentic workflows
  • Agentic technologies
  • Agentic projects
  • Agentic features
  • Agentic architectural patterns
  • Agentic workloads

Other signals

  • customer-facing
  • architectural guidance
  • production deployments
  • GenAI/ML and Agentic technologies