Lead Data Engineer (intelligent Foundations and Experiences)

Capital One Capital One · Banking · McLean, VA +1

Lead Data Engineer role focused on building and pioneering in the technology space, collaborating with Agile teams to design, develop, test, implement, and support technical solutions in Big Data development tools and technologies. The role involves working with machine learning, distributed microservices, and ETL systems, utilizing languages like Java, Scala, Python, and cloud-based data warehousing services. It also emphasizes staying on top of tech trends, experimenting with new technologies, and mentoring. The role mentions leveraging interactive AI tooling to accelerate productivity.

What you'd actually do

  1. Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in Big Data development tools and technologies
  2. Work with a team of developers with deep experience in machine learning, distributed microservices, and ETL systems
  3. Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  4. Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  5. Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment

Skills

Required

  • Bachelor's Degree
  • 4 years of experience in application development
  • 2 years of experience in big data technologies
  • 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)

Nice to have

  • 7+ years of experience in application development including Python, SQL, Scala, or Java
  • 4+ years of experience with a public cloud (AWS, Microsoft Azure, Google Cloud)
  • 4+ years experience with Distributed data/computing tools (MapReduce, Hadoop, Hive, EMR, Kafka, Spark, Gurobi, or MySQL)
  • 4+ year experience working on real-time data and streaming applications
  • 4+ years of experience with NoSQL implementation (Mongo, Cassandra)
  • 4+ years of data warehousing experience (Redshift or Snowflake)
  • 4+ years of experience with UNIX/Linux including basic commands and shell scripting
  • 2+ years of experience with Agile engineering practices
  • Experience leveraging interactive AI tooling to accelerate productivity, utilizing capabilities beyond basic code completion

What the JD emphasized

  • big data technologies
  • cloud computing
  • real-time data and streaming applications
  • NoSQL implementation
  • data warehousing experience