Applied AI ML Lead [multiple Positions Available]

JPMorgan Chase JPMorgan Chase · Banking · Columbus, OH +1 · Consumer & Community Banking

Lead the development and deployment of AI/ML solutions for virtual assistants and transaction search applications within a financial services context. Focus on fine-tuning LLMs and SLMs, optimizing inference, and managing the release cycle for production-grade AI.

What you'd actually do

  1. Engage in cutting-edge initiatives to enhance virtual assistant capabilities.
  2. Drive the development of scalable, production-grade AI solutions through experimentation with large language models (LLMs), small language models (SLMs) and domain-specific fine-tuning.
  3. Own the machine learning solution for the transaction search application, which includes stakeholder management, continuous optimization, and research and innovation.
  4. Lead the build of a question-and-answer solution using advanced NLP techniques, allowing Chase customers to ask questions on the chase.com website.
  5. Lead Al Solution Development, stakeholder management, model release management, research on NLP Solutions to drive project success and deliver production-ready solutions.

Skills

Required

  • Python
  • Supervised and Unsupervised Learning
  • feature engineering
  • Hyperparameter Optimization
  • Neural Networks (CNN, RNN, LSTM, Transformers)
  • Open Source Embedding Models
  • Tokenization
  • Named Entity Recognition
  • Semantic Search
  • Topic Modeling
  • Prompt Engineering
  • System Prompt Design
  • Retrieval-Augmented Generation (RAG)
  • Instruction Fine-Tuning
  • Parameter-Efficient Fine-Tuning
  • Multi-adapter Architectures
  • Domain Adaptation model training for Banking and Financial NLP
  • Synthetic Data Generation for Fine-Tuning
  • Dense Retrieval
  • Sparse Retrieval
  • Hybrid Search
  • Embedding-based Semantic Search
  • Precision, Recall, F1
  • Mean Reciprocal Rank (MRR)
  • Normalized Discounted Cumulative Gain (NDCG)
  • SQUAD Metrics
  • Exact Match
  • Perplexity
  • Multi-class and Multi-label Evaluation
  • Latency Profiling
  • Human-in-the-loop Evaluation
  • Distributed Training (Data Parallel, Fully Sharded Data Parallel, Mixed Precision Training, Multi-GPU Scaling)
  • LoRA (Low Rank Adapters)
  • Fine-Tuning of SLMs
  • KV Caching
  • Semantic Caching
  • Distributed Inference
  • Quantization
  • Low-latency API Design
  • Scaling LLM Serving on GPU and CPU
  • Snowflake
  • Databricks
  • Sagemaker

What the JD emphasized

  • production-grade AI solutions
  • domain-specific fine-tuning
  • model fine-tuning techniques
  • production deployment
  • machine learning applications
  • model optimization
  • stakeholder management
  • continuous optimization
  • research and innovation
  • advanced NLP techniques
  • state-of-the-art research papers
  • deep learning models
  • generative Al methods
  • large (LLM) and small language models (SLM)
  • adapted model for the finance domain
  • optimize training efficiency
  • Al Solution Development
  • model release management
  • NLP Solutions
  • production-ready solutions

Other signals

  • Develop and deploy production-grade AI solutions
  • Lead AI solution development and model release management
  • Optimize models using advanced fine-tuning techniques