Lead Data Engineer

Capital One Capital One · Banking · Plano, TX

Lead Data Engineer role at Capital One focused on designing and building enterprise-level, scalable, low-latency, fault-tolerant streaming data platforms and pipelines using big data technologies like Flink, Spark, and Kafka. The role involves leading a team of engineers, influencing best practices, and collaborating with cross-functional teams to deliver cloud-based solutions. Requires experience in application development, big data technologies, and cloud computing.

What you'd actually do

  1. Lead design and build Enterprise Level scalable, low-latency, fault-tolerant streaming data platform that provides meaningful and timely insights
  2. Build the next generation Distributed Streaming Data Pipelines and Analytics Data Stores using streaming frameworks (Flink, Spark Streaming) using programming languages like Java, Scala, Python
  3. Lead a group of engineers building data pipelines using big data technologies (Spark, Flink, Kafka, Snowflake, AWS Big Data Services, Snowflake, Redshift) on medium to large scale datasets
  4. Influence best practices for Data Pipeline design, Data architecture and processing of structured and unstructured data.
  5. Work in a creative & collaborative environment driven by agile methodologies with focus on CI/CD, Application Resiliency Standards, and partnership with Cyber & Security teams

Skills

Required

  • application development
  • big data technologies
  • cloud computing

Nice to have

  • Java
  • Python
  • SQL
  • Scala
  • public cloud (AWS, Microsoft Azure, Google Cloud)
  • Distributed data computing tools (Flink, Kafka, Spark)
  • real-time data and streaming applications
  • NoSQL implementation (DynamoDB, OpenSearch)
  • data warehousing experience (Redshift or Snowflake)
  • UNIX/Linux
  • shell scripting
  • Agile engineering practices