Lead Data Engineer

Capital One Capital One · Banking · Plano, TX

Lead Data Engineer role at Capital One, focusing on building and managing scalable, low-latency data platforms and pipelines using big data technologies. The role involves leading a team, influencing best practices, and collaborating with various stakeholders to deliver cloud-based solutions. While the role works with teams experienced in machine learning, its core function is data engineering, not direct AI/ML model development.

What you'd actually do

  1. Lead design and build Enterprise Level scalable, low-latency, fault-tolerant, well governed, well managed data platforms and data processing applications that provides meaningful and timely insights
  2. Build the next generation of data driven Governance Controls, Distributed Data Pipelines and Analytics Data Stores using programming languages like Python, Java, Typescript, React
  3. Lead a group of engineers building data pipelines using big data technologies (Databricks, Snowflake, Spark, Kafka, AWS Big Data Services, Redshift) on medium to large scale datasets
  4. Influence best practices for Data Pipeline design, Data architecture and processing of structured and unstructured data.
  5. Work in a creative & collaborative environment driven by agile methodologies with focus on CI/CD, Application Resiliency Standards, and partnership with Cyber & Security teams

Skills

Required

  • Bachelor's Degree
  • 4 years of experience in application development
  • 2 years of experience in big data technologies
  • 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)

Nice to have

  • 7+ years of experience in application development including Java, Python, SQL, or Scala
  • 4+ years of experience with a public cloud (AWS, Microsoft Azure, Google Cloud)
  • 4+ years experience with Distributed data computing tools (Flink, Kafka, Spark)
  • 4+ year experience working on real-time data and streaming applications
  • 4+ years of experience with NoSQL implementation (DynamoDB, OpenSearch)
  • 4+ years of data warehousing experience (Redshift or Snowflake)
  • 4+ years of experience with UNIX/Linux including basic commands and shell scripting
  • 2+ years of experience with Agile engineering practices