Generative AI - Lead

JPMorgan Chase JPMorgan Chase · Banking · New York, NY +1 · Corporate Sector

Lead Generative AI Engineering role focused on designing, delivering, and improving production generative AI products and reusable backend APIs for firmwide adoption. This hands-on leadership position involves architecting scalable systems, setting technical direction for model-enabled services, ensuring reliability and responsible AI controls, and mentoring engineers. The role operates in production environments, emphasizing building at scale and driving execution against measurable outcomes.

What you'd actually do

  1. Lead the design and delivery of production generative artificial intelligence products and reusable backend application programming interfaces for firmwide adoption
  2. Architect scalable systems that combine large enterprise datasets with large language and multimodal models
  3. Set technical direction for model-enabled services, including quality, latency, throughput, and cost targets
  4. Partner with cloud engineering and site reliability engineering teams to deliver resilient architectures, observability, and operational readiness
  5. Drive translation of research concepts into production-ready capabilities through evaluation, iteration, and hardening

Skills

Required

  • PhD in a quantitative discipline such as Computer Science, Mathematics, or Statistics, or equivalent practical experience
  • 7+ years of experience in machine learning engineering and/or applied software engineering delivering production systems
  • 3+ years of technical leadership experience, including leading delivery for complex cross-functional initiatives
  • Demonstrated experience owning enterprise machine learning services, including reliability, incident management, and service-level outcomes
  • Strong fundamentals in statistics, optimization, and machine learning theory with applied expertise in natural language processing and/or computer vision
  • Hands-on experience implementing distributed, multi-threaded, scalable systems (for example Ray, Horovod, or DeepSpeed)
  • Proven ability to design and scale service-oriented architectures and application programming interfaces with high availability and performance requirements
  • Experience defining success metrics and writing clear objectives and key results aligned to business expectations
  • Strong judgment to align technical decisions with governance, risk, and control requirements for responsible artificial intelligence
  • Excellent communication and stakeholder management skills, with ability to influence across senior technical and business audiences

Nice to have

  • Experience designing and implementing machine learning pipelines using directed acyclic graph frameworks (for example Kubeflow, DVC, or Ray)
  • Experience building batch and streaming microservices exposed via gRPC and/or GraphQL
  • Demonstrable experience with parameter-efficient fine-tuning, quantization, and quantization-aware fine-tuning for large language models
  • Experience with multimodal large language model use cases (text plus image, speech, or video)
  • Experience with advanced prompting and reasoning approaches such as chain-of-thought, tree-of-thought, or graph-of-thought
  • Experience establishing evaluation frameworks and production monitoring for model quality, safety, and drift
  • Experience building reusable platforms that enable other teams to ship model-enabled products faster

What the JD emphasized

  • lead the delivery of enterprise-grade generative artificial intelligence products and platforms with strong governance and controls
  • building at scale and operating in real production environments
  • lead the design, delivery, and continuous improvement of production generative artificial intelligence products and reusable backend application programming interfaces
  • responsible artificial intelligence controls
  • strong judgment to align technical decisions with governance, risk, and control requirements for responsible artificial intelligence

Other signals

  • leading delivery of enterprise-grade generative AI products
  • building at scale and operating in real production environments
  • reusable backend application programming interfaces used across the firm
  • responsible artificial intelligence controls
  • architect scalable systems that combine large enterprise datasets with large language and multimodal models