Principal Data Scientist - AI Foundations, Specialist Models

Capital One Capital One · Banking · McLean, VA +2

The Entity Resolution Systems team builds and ships state-of-the-art machine learning solutions for entity resolution within the enterprise, impacting marketing and customer servicing. The role involves partnering with cross-functional teams to build a next-generation entity resolution solution using cutting-edge ML, including transformers and graph ML, and leveraging agentic AI tools and workflows. The ideal candidate is innovative, creative, technical, statistically-minded, and a data guru.

What you'd actually do

  1. Partner with a cross-functional team of data scientists, software engineers, and product managers to build a next-generation entity resolution solution that leverages cutting-edge machine learning approaches to solve real-world business problems
  2. Leverage a broad stack of technologies — Python, AWS, Spark, modern state of the art models like transformers, graph ML, and more — to reveal the insights hidden within huge volumes of numeric and textual data
  3. Build machine learning models through all phases of development, from design through training, evaluation, validation, and implementation
  4. Leverage Agentic AI tools and workflows to build and test
  5. Flex your interpersonal skills to translate the complexity of your work into tangible business goals

Skills

Required

  • Python
  • AWS
  • Spark
  • SQL
  • Machine Learning
  • Data Analytics
  • Statistics
  • Economics
  • Operations Research
  • Mathematics
  • Computer Science

Nice to have

  • Scala
  • R
  • clustering
  • classification
  • sentiment analysis
  • time series
  • deep learning

What the JD emphasized

  • entity resolution
  • agentic software development practices
  • deep learning
  • transformer architectures
  • graph approaches
  • agentic AI tools and workflows

Other signals

  • entity resolution
  • transformer architectures
  • graph approaches
  • agentic software development