AI Solution Architect

Amazon Amazon · Big Tech · IN, MH, Mumbai · Solutions Architect

AI Specialist Solutions Architect for AWS, focusing on helping enterprise customers adopt and scale GenAI, ML, and Agentic technologies. The role involves designing technical architectures, advising on best practices, and acting as a trusted advisor to customers, with a strong emphasis on production deployment and operational efficiency.

What you'd actually do

  1. The AI Specialist SA team builds technical relationships with customers of all sizes and operate as their trusted advisor, ensuring they get the most out of the cloud at every stage of their journey while adopting GenAI/ML and Agentic technologies across their organisation.
  2. You’ll manage the overall technical relationship between AWS and our customers, making recommendations on security, cost, performance, reliability and operational efficiency to accelerate their challenging GenAI/ML and Agentic projects.
  3. Internally, you will be the voice of the customer, sharing their needs with regard to their usage of our services impacting the roadmap of AWS GenAI/ML and Agentic features.
  4. In this role, your creativity will link technology to tangible solutions, with the opportunity to define cloud-native GenAI/ML and Agentic architectural patterns for a variety of use cases.
  5. You will participate in the creation and sharing of best practices, technical content and new reference architectures (e.g. white papers, code samples, blog posts) and evangelize and educate about running GenAI/ML and Agentic workloads on AWS technology (e.g. through workshops, user groups, meetups, public speaking, online videos or conferences).

Skills

Required

  • design/implementation/operations/consulting with distributed applications
  • management of technical, enterprise customer facing resources
  • communicating complex concepts clearly and effectively
  • leading engineering discussions around technology decisions and strategy
  • IT systems or relevant commercial production environment experience
  • developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware
  • designing or architecting (design patterns, reliability and scaling) of new and existing systems
  • Hands-on experience with AWS ecosystems (including Bedrock, AgentCore, and SageMaker)
  • implementing Retrieval-Augmented Generation using embeddings, vector stores, and semantic search optimization

Nice to have

  • Machine Learning and Large Language Model fundamentals, including architecture, training/inference lifecycles, and optimization of model execution
  • full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
  • engaging and influencing C-level executives, both business and technical
  • Cloud Technology Certification, or AWS Professional level certification
  • developing solutions and executing plans on complex projects
  • leading and influencing your team or organization
  • Master's degree or above in computer science, mathematics, statistics, machine learning or equivalent quantitative field, or PhD
  • PyTorch, JIT compilation, and AOT tracing
  • multi-agent orchestration, tool use, memory, and guardrails using frameworks such as LangGraph, AutoGen, or AWS AgentCore
  • responsible AI tooling including AWS Clarify, Guardrails for Bedrock, model explainability, and bias detection
  • determine solution strategy and where to simplify or extend solutions for the best outcome
  • architecting AI systems within highly regulated or security-sensitive environments (e.g., Financial Services, Healthcare, Public Sector)

What the JD emphasized

  • deploying LLMs in production
  • Retrieval-Augmented Generation using embeddings, vector stores, and semantic search optimization
  • multi-agent orchestration, tool use, memory, and guardrails
  • architecting AI systems within highly regulated or security-sensitive environments

Other signals

  • customer adoption of GenAI/ML and Agentic technologies
  • architectural patterns for GenAI/ML and Agentic workloads
  • deploying LLMs in production