Manager, Data Science - LLM Customization Team

Capital One Capital One · Banking · McLean, VA +2

Capital One is seeking a Manager, Data Science to lead the LLM Customization team, focusing on bringing LLMs and GenAI to life within the company. The role involves partnering with cross-functional teams to deliver AI-powered products, leveraging technologies like Pytorch, Hugging Face, and LangChain, and specializing in NLP for LLM adaptation and fine-tuning. Responsibilities include building NLP models through development, training, evaluation, and validation, and operationalizing them in production systems. The ideal candidate is innovative, creative, technical, and influential, with experience in training language models, large computer vision models, and delivering models at scale.

What you'd actually do

  1. Partner with a cross-functional team of data scientists, applied researchers, software engineers, machine learning engineers and product managers to deliver AI powered products that change how customers interact with their money.
  2. Leverage a broad stack of technologies — Pytorch, Hugging Face, AWS Ultraclusters, LangChain, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data.
  3. Be the expert in Natural Language Processing (NLP) to harness the power of Large Language Models (LLMs), adapt and finetune them for business specific applications and features.
  4. Build NLP models through all phases of development, from design through training, evaluation, and validation; partnering with engineering teams to operationalize them in scalable and resilient production systems.
  5. Flex your interpersonal skills to translate the complexity of your work into tangible business goals.

Skills

Required

  • Quantitative field degree (Statistics, Economics, Operations Research, Analytics, Mathematics, Computer Science, or a related quantitative field)
  • Experience performing data analytics
  • Leveraging open source programming languages for large scale data analysis
  • Working with machine learning
  • Utilizing relational databases

Nice to have

  • PhD in STEM field
  • AWS experience
  • Python, Scala, or R for large scale data analysis
  • SQL

What the JD emphasized

  • customization
  • LLM
  • GenAI
  • NLP
  • Large Language Models (LLMs)
  • adapt and finetune
  • training
  • evaluation
  • validation
  • operationalize
  • training language models
  • large computer vision models
  • delivering models at scale

Other signals

  • LLM Customization
  • Generative AI
  • NLP
  • Production Systems
  • Data Science Management