AI Engineer - Lead Software Engineer

JPMorgan Chase · Banking · Jersey City, NJ +1 · Corporate Sector

Lead Software Engineer to architect and implement scalable large language model systems and agentic AI platforms for enterprise use cases. Responsibilities include designing cloud-native solutions, establishing evaluation and observability standards, optimizing platform performance, and mentoring engineers. The role involves building production-grade AI systems with agents, skills, memory, guardrails, and tool-use orchestration, as well as architecting retrieval and context-engineering approaches.

What you'd actually do

Lead the architecture and hands-on delivery of scalable, reliable agentic AI platforms for enterprise workflows
Design and build production-grade AI systems including agents, skills, memory patterns, guardrails, and tool-use orchestration
Architect retrieval and context-engineering approaches including embeddings, semantic search, grounding, summarization, and prompt/version management
Engineer cloud-native AI services on AWS using containers and serverless patterns, event-driven messaging, and distributed data stores
Optimize platform performance across latency, throughput, scalability, caching, context efficiency, and cost controls

Skills

Required

Formal training or certification on software engineering concepts
5+ years applied experience
Experience architecting and shipping production large language model applications, including agentic workflows and tool integration patterns
Strong software engineering fundamentals with ability to deliver cloud-native services using containers and serverless designs on AWS
Proficiency designing distributed systems with asynchronous workflows, durable messaging, and scalable data access patterns
Experience building retrieval-augmented generation solutions (embeddings, semantic search, grounding) and managing prompt lifecycle/versioning
Demonstrated ability to implement evaluation and monitoring approaches for model quality, reliability, and safe behavior over time
Strong API design skills, including secure integration patterns and reusable platform capability development
Proven technical leadership skills, including mentoring, driving architecture decisions, and influencing cross-functional stakeholders
Hands-on experience using enterprise-authorized AI-assisted software development tools within the work environment (e.g., for coding, test creation, troubleshooting, or documentation) with demonstrated ability to critically evaluate, validate, and refine AI-generated outputs for correctness, performance, and security.
Understanding of responsible AI use in engineering workflows, including data sensitivity considerations, secure handling of inputs/outputs, and adherence to resiliency and security expectations; ability to guide peers on safe and effective usage within team practices.

Nice to have

Experience building standardized evaluation harnesses, automated regression suites, and experimentation platforms for large language model systems
Hands-on experience with Kubernetes-based deployment patterns and operational excellence practices for high-availability services
Experience applying privacy, data minimization, and safe AI guardrail patterns in regulated or high-risk environments
Familiarity with context-efficiency optimization techniques and cost governance for large language model workloads
Experience building reusable developer platforms, reference architectures, and technical standards across multiple teams

What the JD emphasized

5+ years applied experience
architecting and shipping production large language model applications, including agentic workflows and tool integration patterns
Strong software engineering fundamentals
cloud-native services using containers and serverless designs on AWS
distributed systems with asynchronous workflows, durable messaging, and scalable data access patterns
retrieval-augmented generation solutions (embeddings, semantic search, grounding) and managing prompt lifecycle/versioning
evaluation and monitoring approaches for model quality, reliability, and safe behavior over time
API design skills, including secure integration patterns and reusable platform capability development
Proven technical leadership skills, including mentoring, driving architecture decisions, and influencing cross-functional stakeholders
Hands-on experience using enterprise-authorized AI-assisted software development tools within the work environment
Understanding of responsible AI use in engineering workflows, including data sensitivity considerations, secure handling of inputs/outputs, and adherence to resiliency and security expectations
Experience applying privacy, data minimization, and safe AI guardrail patterns in regulated or high-risk environments

Other signals

building production AI platforms
agentic AI platforms
LLM Suite
enterprise use cases
cloud-native solutions
evaluation and observability standards
developer velocity
production-grade AI systems
agents, skills, memory patterns, guardrails, and tool-use orchestration
retrieval and context-engineering approaches
embeddings, semantic search, grounding, summarization, and prompt/version management
cloud-native AI services on AWS
containers and serverless patterns
event-driven messaging
distributed data stores
platform performance optimization
latency, throughput, scalability, caching, context efficiency, and cost controls
well-governed APIs and integrations
evaluation, experimentation, regression testing, and observability frameworks
reliability, security, and safe AI operation
senior engineers
architecture forums
cross-team technical leadership
enterprise-authorized AI coding assist tools
code generation/refactoring
unit test creation
documentation
peer review
automated testing
secure coding standards
Software Development Life Cycle toolchain
AI-assisted development and automation capabilities
architecting and shipping production large language model applications
agentic workflows and tool integration patterns
cloud-native services using containers and serverless designs on AWS
distributed systems with asynchronous workflows, durable messaging, and scalable data access patterns
retrieval-augmented generation solutions
embeddings, semantic search, grounding
prompt lifecycle/versioning
evaluation and monitoring approaches for model quality, reliability, and safe behavior
API design skills
secure integration patterns
reusable platform capability development
technical leadership skills
mentoring
driving architecture decisions
influencing cross-functional stakeholders
enterprise-authorized AI-assisted software development tools
critically evaluate, validate, and refine AI-generated outputs
responsible AI use in engineering workflows
data sensitivity considerations
secure handling of inputs/outputs
resiliency and security expectations
guide peers on safe and effective usage
standardized evaluation harnesses
automated regression suites
experimentation platforms for large language model systems
Kubernetes-based deployment patterns
operational excellence practices for high-availability services
privacy, data minimization, and safe AI guardrail patterns
regulated or high-risk environments
context-efficiency optimization techniques
cost governance for large language model workloads
reusable developer platforms
reference architectures
technical standards across multiple teams
Section 19 of the Federal Deposit Insurance Act
criminal conviction history
pretrial diversions or program entries

Read full job description

As an Lead Software Engineer at JPMorganChase within Enterprise Technology, you will lead the architecture and hands-on implementation of scalable large language model systems and agentic AI platforms for enterprise use cases leveraging LLM Suite. You will design cloud-native solutions, establish evaluation and observability standards, and drive technical decisions across teams to improve reliability, cost, and developer velocity. LLM Suite is JPMorganChase’s premier internally built AI tool leveraged by +250k employees for everything from individual productivity to larger scale, business solutions.

Build and scale production AI platforms that turn large language model capabilities into reliable, secure, and measurable business outcomes. You will partner across product and engineering teams to design architectures, ship reusable capabilities, and raise quality through strong engineering practices and technical leadership.

Job responsibilities

Lead the architecture and hands-on delivery of scalable, reliable agentic AI platforms for enterprise workflows
Design and build production-grade AI systems including agents, skills, memory patterns, guardrails, and tool-use orchestration
Architect retrieval and context-engineering approaches including embeddings, semantic search, grounding, summarization, and prompt/version management
Engineer cloud-native AI services on AWS using containers and serverless patterns, event-driven messaging, and distributed data stores
Optimize platform performance across latency, throughput, scalability, caching, context efficiency, and cost controls
Build well-governed APIs and integrations that connect AI capabilities to enterprise platforms, tools, and business processes
Establish evaluation, experimentation, regression testing, and observability frameworks to continuously improve quality and agent behavior
Define engineering standards for reliability, security, and safe AI operation across the platform lifecycle
Mentor senior engineers and influence engineering direction through code reviews, architecture forums, and cross-team technical leadership
Leverages enterprise-authorized AI coding assist tools within the work environment to improve code quality, delivery speed, and productivity across complex deliverables (e.g., code generation/refactoring, unit test creation, documentation), while validating outputs through peer review, automated testing, and secure coding standards; contributes learnings and reusable patterns to improve broader team effectiveness.
Applies knowledge of tools within the Software Development Life Cycle toolchain, including enterprise-authorized AI-assisted development and automation capabilities, to improve the value realized by automation.

Required qualifications, capabilities and skills

Formal training or certification on software engineering concepts and 5+ years applied experience
Experience architecting and shipping production large language model applications, including agentic workflows and tool integration patterns
Strong software engineering fundamentals with ability to deliver cloud-native services using containers and serverless designs on AWS
Proficiency designing distributed systems with asynchronous workflows, durable messaging, and scalable data access patterns
Experience building retrieval-augmented generation solutions (embeddings, semantic search, grounding) and managing prompt lifecycle/versioning
Demonstrated ability to implement evaluation and monitoring approaches for model quality, reliability, and safe behavior over time
Strong API design skills, including secure integration patterns and reusable platform capability development
Proven technical leadership skills, including mentoring, driving architecture decisions, and influencing cross-functional stakeholders
Hands-on experience using enterprise-authorized AI-assisted software development tools within the work environment (e.g., for coding, test creation, troubleshooting, or documentation) with demonstrated ability to critically evaluate, validate, and refine AI-generated outputs for correctness, performance, and security.
Understanding of responsible AI use in engineering workflows, including data sensitivity considerations, secure handling of inputs/outputs, and adherence to resiliency and security expectations; ability to guide peers on safe and effective usage within team practices.

Preferred qualifications, capabilities and skills

Experience building standardized evaluation harnesses, automated regression suites, and experimentation platforms for large language model systems
Hands-on experience with Kubernetes-based deployment patterns and operational excellence practices for high-availability services
Experience applying privacy, data minimization, and safe AI guardrail patterns in regulated or high-risk environments
Familiarity with context-efficiency optimization techniques and cost governance for large language model workloads
Experience building reusable developer platforms, reference architectures, and technical standards across multiple teams

FEDERAL DEPOSIT INSURANCE ACT: This position is subject to Section 19 of the Federal Deposit Insurance Act. As such, an employment offer for this position is contingent on JPMorganChase’s review of criminal conviction history, including pretrial diversions or program entries.