Applied AI ML Lead - LLM Suite Engineering

JPMorgan Chase JPMorgan Chase · Banking · Wilmington, DE +1 · Corporate Sector

Lead the architecture and hands-on implementation of scalable large language model systems and agentic AI platforms for enterprise use cases, focusing on building production-grade AI systems, optimizing performance, and establishing evaluation and observability standards.

What you'd actually do

  1. Lead the architecture and hands-on delivery of scalable, reliable agentic AI platforms for enterprise workflows
  2. Design and build production-grade AI systems including agents, skills, memory patterns, guardrails, and tool-use orchestration
  3. Architect retrieval and context-engineering approaches including embeddings, semantic search, grounding, summarization, and prompt/version management
  4. Engineer cloud-native AI services on AWS using containers and serverless patterns, event-driven messaging, and distributed data stores
  5. Optimize platform performance across latency, throughput, scalability, caching, context efficiency, and cost controls

Skills

Required

  • applied AI and machine learning concepts
  • architecting and shipping production large language model applications
  • agentic workflows
  • tool integration patterns
  • cloud-native services
  • containers
  • serverless designs on AWS
  • distributed systems
  • asynchronous workflows
  • durable messaging
  • scalable data access patterns
  • retrieval-augmented generation solutions
  • embeddings
  • semantic search
  • grounding
  • prompt lifecycle/versioning
  • evaluation and monitoring approaches
  • model quality
  • reliability
  • safe behavior
  • API design skills
  • secure integration patterns
  • reusable platform capability development
  • technical leadership
  • mentoring
  • driving architecture decisions
  • influencing cross-functional stakeholders

Nice to have

  • standardized evaluation harnesses
  • automated regression suites
  • experimentation platforms for large language model systems
  • Kubernetes-based deployment patterns
  • operational excellence practices for high-availability services
  • privacy
  • data minimization
  • safe AI guardrail patterns
  • regulated or high-risk environments
  • context-efficiency optimization techniques
  • cost governance for large language model workloads
  • reusable developer platforms
  • reference architectures
  • technical standards across multiple teams

What the JD emphasized

  • 8+ years applied experience
  • architecting and shipping production large language model applications, including agentic workflows and tool integration patterns
  • building retrieval-augmented generation solutions (embeddings, semantic search, grounding) and managing prompt lifecycle/versioning
  • implement evaluation and monitoring approaches for model quality, reliability, and safe behavior over time
  • Proven technical leadership skills, including mentoring, driving architecture decisions, and influencing cross-functional stakeholders

Other signals

  • Build and scale production AI platforms
  • lead the architecture and hands-on implementation of scalable large language model systems and agentic AI platforms
  • design cloud-native solutions, establish evaluation and observability standards, and drive technical decisions across teams