AI Engineer - Lead Software Engineer

JPMorgan Chase JPMorgan Chase · Banking · Jersey City, NJ +1 · Corporate Sector

Lead Software Engineer to architect and implement scalable large language model systems and agentic AI platforms for enterprise use cases. Responsibilities include designing cloud-native solutions, establishing evaluation and observability standards, optimizing platform performance, and mentoring engineers. The role involves building production-grade AI systems with agents, skills, memory, guardrails, and tool-use orchestration, as well as architecting retrieval and context-engineering approaches.

What you'd actually do

  1. Lead the architecture and hands-on delivery of scalable, reliable agentic AI platforms for enterprise workflows
  2. Design and build production-grade AI systems including agents, skills, memory patterns, guardrails, and tool-use orchestration
  3. Architect retrieval and context-engineering approaches including embeddings, semantic search, grounding, summarization, and prompt/version management
  4. Engineer cloud-native AI services on AWS using containers and serverless patterns, event-driven messaging, and distributed data stores
  5. Optimize platform performance across latency, throughput, scalability, caching, context efficiency, and cost controls

Skills

Required

  • Formal training or certification on software engineering concepts
  • 5+ years applied experience
  • Experience architecting and shipping production large language model applications, including agentic workflows and tool integration patterns
  • Strong software engineering fundamentals with ability to deliver cloud-native services using containers and serverless designs on AWS
  • Proficiency designing distributed systems with asynchronous workflows, durable messaging, and scalable data access patterns
  • Experience building retrieval-augmented generation solutions (embeddings, semantic search, grounding) and managing prompt lifecycle/versioning
  • Demonstrated ability to implement evaluation and monitoring approaches for model quality, reliability, and safe behavior over time
  • Strong API design skills, including secure integration patterns and reusable platform capability development
  • Proven technical leadership skills, including mentoring, driving architecture decisions, and influencing cross-functional stakeholders
  • Hands-on experience using enterprise-authorized AI-assisted software development tools within the work environment (e.g., for coding, test creation, troubleshooting, or documentation) with demonstrated ability to critically evaluate, validate, and refine AI-generated outputs for correctness, performance, and security.
  • Understanding of responsible AI use in engineering workflows, including data sensitivity considerations, secure handling of inputs/outputs, and adherence to resiliency and security expectations; ability to guide peers on safe and effective usage within team practices.

Nice to have

  • Experience building standardized evaluation harnesses, automated regression suites, and experimentation platforms for large language model systems
  • Hands-on experience with Kubernetes-based deployment patterns and operational excellence practices for high-availability services
  • Experience applying privacy, data minimization, and safe AI guardrail patterns in regulated or high-risk environments
  • Familiarity with context-efficiency optimization techniques and cost governance for large language model workloads
  • Experience building reusable developer platforms, reference architectures, and technical standards across multiple teams

What the JD emphasized

  • 5+ years applied experience
  • architecting and shipping production large language model applications, including agentic workflows and tool integration patterns
  • Strong software engineering fundamentals
  • cloud-native services using containers and serverless designs on AWS
  • distributed systems with asynchronous workflows, durable messaging, and scalable data access patterns
  • retrieval-augmented generation solutions (embeddings, semantic search, grounding) and managing prompt lifecycle/versioning
  • evaluation and monitoring approaches for model quality, reliability, and safe behavior over time
  • API design skills, including secure integration patterns and reusable platform capability development
  • Proven technical leadership skills, including mentoring, driving architecture decisions, and influencing cross-functional stakeholders
  • Hands-on experience using enterprise-authorized AI-assisted software development tools within the work environment
  • Understanding of responsible AI use in engineering workflows, including data sensitivity considerations, secure handling of inputs/outputs, and adherence to resiliency and security expectations
  • Experience applying privacy, data minimization, and safe AI guardrail patterns in regulated or high-risk environments

Other signals

  • building production AI platforms
  • agentic AI platforms
  • LLM Suite
  • enterprise use cases
  • cloud-native solutions
  • evaluation and observability standards
  • developer velocity
  • production-grade AI systems
  • agents, skills, memory patterns, guardrails, and tool-use orchestration
  • retrieval and context-engineering approaches
  • embeddings, semantic search, grounding, summarization, and prompt/version management
  • cloud-native AI services on AWS
  • containers and serverless patterns
  • event-driven messaging
  • distributed data stores
  • platform performance optimization
  • latency, throughput, scalability, caching, context efficiency, and cost controls
  • well-governed APIs and integrations
  • evaluation, experimentation, regression testing, and observability frameworks
  • reliability, security, and safe AI operation
  • senior engineers
  • architecture forums
  • cross-team technical leadership
  • enterprise-authorized AI coding assist tools
  • code generation/refactoring
  • unit test creation
  • documentation
  • peer review
  • automated testing
  • secure coding standards
  • Software Development Life Cycle toolchain
  • AI-assisted development and automation capabilities
  • architecting and shipping production large language model applications
  • agentic workflows and tool integration patterns
  • cloud-native services using containers and serverless designs on AWS
  • distributed systems with asynchronous workflows, durable messaging, and scalable data access patterns
  • retrieval-augmented generation solutions
  • embeddings, semantic search, grounding
  • prompt lifecycle/versioning
  • evaluation and monitoring approaches for model quality, reliability, and safe behavior
  • API design skills
  • secure integration patterns
  • reusable platform capability development
  • technical leadership skills
  • mentoring
  • driving architecture decisions
  • influencing cross-functional stakeholders
  • enterprise-authorized AI-assisted software development tools
  • critically evaluate, validate, and refine AI-generated outputs
  • responsible AI use in engineering workflows
  • data sensitivity considerations
  • secure handling of inputs/outputs
  • resiliency and security expectations
  • guide peers on safe and effective usage
  • standardized evaluation harnesses
  • automated regression suites
  • experimentation platforms for large language model systems
  • Kubernetes-based deployment patterns
  • operational excellence practices for high-availability services
  • privacy, data minimization, and safe AI guardrail patterns
  • regulated or high-risk environments
  • context-efficiency optimization techniques
  • cost governance for large language model workloads
  • reusable developer platforms
  • reference architectures
  • technical standards across multiple teams
  • Section 19 of the Federal Deposit Insurance Act
  • criminal conviction history
  • pretrial diversions or program entries