Lead Data Engineer (python, Aws, Spark, Sql, Snowflake, Databricks, Genai)

Capital One Capital One · Banking · McLean, VA +1

Lead Data Engineer role focused on building data pipelines and platforms for personalized marketing and messaging within a fintech company. Requires experience with Python, SQL, Spark, cloud platforms (AWS), and data warehousing (Snowflake, Databricks).

What you'd actually do

  1. Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions using data movement tools and technologies
  2. Work, as a lead developer, with a team of developers with deep experience in data movement, distributed computing, and full stack systems
  3. Utilize programming languages like Python, SQL and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Snowflake, Databricks
  4. Optimize information system for end-user and downstream application consumers by using sound data design practices
  5. Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community

Skills

Required

  • Bachelor's Degree
  • 4 years of experience in application development
  • 2 years of experience in big data technologies
  • 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)

Nice to have

  • Master's Degree
  • Experience leveraging AI-assisted coding tools (Claude Code, GitHub Copilot) to accelerate the software delivery
  • 7+ years of experience in application development including Python, SQL, Spark, ETL tools, AWS Glue
  • 4+ years of experience with a public cloud (AWS, Microsoft Azure, Google Cloud)
  • 4+ years experience with Distributed data/computing tools (MapReduce, Hadoop, Hive, EMR, Kafka, Spark, or MySQL)
  • 4+ year experience working on real-time data and streaming applications
  • 4+ years of experience with NoSQL implementation (Mongo, Cassandra)
  • 4+ years of data warehousing experience (Redshift or Snowflake)
  • 4+ years of experience with UNIX/Linux including basic commands and shell scripting
  • 4+ years experience with data modeling for data warehousing
  • 2+ years of experience with Agile engineering practices

What the JD emphasized

  • GenAI