Principal Associate, Data Scientist - LLM Customization Team

Capital One Capital One · Banking · New York, NY +1

Capital One is seeking a Principal Associate Data Scientist to join their LLM Customization Team. This role involves partnering with cross-functional teams to deliver AI-powered products, leveraging technologies like Pytorch, Hugging Face, LangChain, and VectorDBs. The primary focus is on adapting and fine-tuning LLMs for business-specific applications, building NLP models through all phases of development, and operationalizing them in production systems.

What you'd actually do

  1. Partner with a cross-functional team of data scientists, applied researchers, software engineers, machine learning engineers and product managers to deliver AI powered products that change how customers interact with their money.
  2. Leverage a broad stack of technologies — Pytorch, Hugging Face, AWS Ultraclusters, LangChain, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data.
  3. Be the expert in Natural Language Processing (NLP) to harness the power of Large Language Models (LLMs), adapt and finetune them for business specific applications and features.
  4. Build NLP models through all phases of development, from design through training, evaluation, and validation; partnering with engineering teams to operationalize them in scalable and resilient production systems.

Skills

Required

  • Python
  • Scala
  • R
  • machine learning
  • Large Language Models (LLM)
  • Finetuning
  • Deep Learning
  • NLP

Nice to have

  • AWS
  • Pytorch
  • Hugging Face
  • LangChain
  • VectorDBs
  • training optimization
  • self-supervised learning
  • explainability
  • RLHF

What the JD emphasized

  • deliver AI powered products
  • adapt and finetune them
  • operationalize them in scalable and resilient production systems
  • delivering models at scale both in training data and inference volumes
  • experience in delivering libraries, platforms, or solution level code to existing products

Other signals

  • LLM Customization
  • building production systems
  • applying state of the art AI