Director, Data Scientist

Capital One Capital One · Banking · McLean, VA +1

Director of Data Science for the Generative AI Systems team at Capital One, focusing on building and delivering state-of-the-art generative AI solutions for internal efficiency and customer-facing applications. The role involves leading a team of NLP, speech, and computer vision specialists, experimenting with emerging generative AI technologies, and contributing to research.

What you'd actually do

  1. Partner with a cross-functional team of data scientists, software engineers, and product managers to deliver a product customers love
  2. Leverage a broad stack of technologies — Python, Kubernetes, AWS, H2O, Spark, and more — to reveal the insights hidden within huge volumes of numeric and textual data
  3. Build machine learning models through all phases of development, from design through training, evaluation, validation, and implementation
  4. Flex your interpersonal skills to translate the complexity of your work into tangible business goals

Skills

Required

  • Bachelor's Degree in a quantitative field or equivalent experience
  • 9 years of experience performing data analytics (with Bachelor's)
  • Master's Degree in a quantitative field or MBA with quantitative concentration or equivalent experience
  • 7 years of experience performing data analytics (with Master's)
  • PHD in a quantitative field or equivalent experience
  • 4 years of experience performing data analytics (with PHD)
  • 4 years of experience leveraging open source programming languages for large scale data analysis
  • 4 years of experience working with machine learning
  • 4 years of experience utilizing relational databases

Nice to have

  • PhD in STEM field
  • 5 years of experience in data analytics
  • 1 year of experience working with AWS
  • 3 years of experience managing people
  • 5 years of experience in Python, Scala, or R for large scale data analysis
  • 5 years of experience with machine learning

What the JD emphasized

  • Generative AI
  • dialogue
  • text summarization
  • reading comprehension
  • speech recognition
  • image/document processing
  • time-series sequencing modeling
  • customer-facing applications
  • internal applications
  • data analytics
  • machine learning
  • Python
  • AWS

Other signals

  • Generative AI
  • LLMs
  • NLP
  • speech recognition
  • image/document processing
  • time-series sequencing modeling