Generative AI - Senior Associate

JPMorgan Chase JPMorgan Chase · Banking · New York, NY +1 · Corporate Sector

This role focuses on designing, building, and supporting production generative AI products and reusable backend APIs within a large enterprise. It involves combining enterprise data with LLMs and multimodal models to deliver scalable, measurable solutions, collaborating with ML, cloud, and SRE teams, and ensuring reliability, performance, and operational controls. The role emphasizes hands-on engineering, real-world constraints, and high-impact delivery.

What you'd actually do

  1. Build and operate production generative artificial intelligence services and reusable backend application programming interfaces for firmwide use
  2. Combine enterprise data assets with large language and multimodal models to deliver high-quality user experiences
  3. Design scalable architectures with clear interfaces and separation of concerns to enable broader developer adoption
  4. Implement batch and real-time processing patterns to support high-throughput, low-latency use cases
  5. Collaborate with cloud engineering and site reliability engineering partners to deliver resilient, observable systems

Skills

Required

  • PhD in a quantitative discipline such as Computer Science, Mathematics, or Statistics, or equivalent practical experience
  • 3+ years of experience as an individual contributor in machine learning engineering or applied machine learning software engineering
  • Demonstrated experience delivering production machine learning services in an enterprise environment, including being accountable for service health
  • Strong fundamentals in statistics, optimization, and machine learning theory with applied depth in natural language processing and/or computer vision
  • Hands-on experience building distributed, multi-threaded, and scalable systems (for example Ray, Horovod, or DeepSpeed)
  • Strong software engineering fundamentals, including data structures, algorithms, and software development lifecycle best practices
  • Experience designing and delivering service-oriented systems and application programming interfaces with scalability and performance requirements
  • Ability to define success metrics and write clear objectives and key results aligned to business expectations
  • Strong problem-framing skills to align machine learning solutions to business objectives and constraints
  • Excellent communication skills with the ability to influence and build trust across technical and non-technical stakeholders

Nice to have

  • Experience designing and implementing pipeline workflows using directed acyclic graph frameworks (for example Kubeflow, DVC, or Ray)
  • Experience building batch and streaming microservices exposed via gRPC and/or GraphQL
  • Demonstrable experience with parameter-efficient fine-tuning, quantization, and quantization-aware fine-tuning for large language models
  • Experience with advanced prompting strategies such as chain-of-thought, tree-of-thought, or graph-of-thought approaches
  • Experience with multimodal large language model use cases (text plus image, speech, or video)
  • Experience partnering closely with cloud engineering and site reliability engineering teams on production readiness and operations
  • Experience measuring and improving model quality using offline evaluation and production monitoring

What the JD emphasized

  • production generative artificial intelligence products
  • reusable backend application programming interfaces
  • enterprise environment
  • service health
  • scalable systems
  • service-oriented systems
  • scalability and performance requirements
  • responsible artificial intelligence practices, controls, and governance

Other signals

  • production generative artificial intelligence products
  • reusable backend application programming interfaces
  • large enterprise datasets with large language and multimodal models
  • scalable, measurable solutions
  • reliability, performance, and strong operational controls
  • technical design decisions, delivery planning, and continuous improvement