Director, Data Scientist - Generative AI Systems

Capital One Capital One · Banking · McLean, VA +1

Capital One is seeking a Director, Data Scientist to lead the Generative AI Systems team. This role involves building and operationalizing AI-powered products, specifically focusing on LLMs for customer-facing applications in dialogue, summarization, comprehension, speech, and image processing. The position requires leading a team of specialists, experimenting with generative AI, and contributing to research. The role emphasizes partnering with cross-functional teams, leveraging technologies like PyTorch, AWS, Hugging Face, LangChain, and VectorDBs, and managing the full ML lifecycle from design to production for over 80 million customers.

What you'd actually do

  1. Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI powered products that change how customers interact with their money.
  2. Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Hugging Face, LangChain, Lightning, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data.
  3. Be the expert in Natural Language Processing (NLP) to harness the power of Large Language Models (LLMs), adapt and finetune them for customer facing applications and features.
  4. Build machine learning and NLP models through all phases of development, from design through training, evaluation, and validation; partnering with engineering teams to operationalize them in scalable and resilient production systems that serve 80+ million customers.
  5. Flex your interpersonal skills to translate the complexity of your work into tangible business goals.

Skills

Required

  • Bachelor's Degree in a quantitative field plus 9 years of experience performing data analytics OR Master's Degree in a quantitative field or MBA with a quantitative concentration plus 7 years of experience performing data analytics OR PHD in a quantitative field plus 4 years of experience performing data analytics
  • At least 4 years of experience leveraging open source programming languages for large scale data analysis
  • At least 4 years of experience working with machine learning
  • At least 4 years of experience utilizing relational databases

Nice to have

  • PhD in “STEM” field plus 5 years of experience in data analytics
  • At least 1 year of experience working with AWS
  • At least 3 years of experience managing people
  • At least 5 years of experience in Python, Scala, or R for large scale data analysis
  • At least 5 years of experience with machine learning

What the JD emphasized

  • customer facing applications
  • production systems
  • 80+ million customers

Other signals

  • Generative AI
  • LLMs
  • customer-facing applications
  • production systems