Lead Data Engineer (enterprise Platforms Technology)

Capital One Capital One · Banking · Plano, TX +1

Lead Data Engineer role focused on building and pioneering technology solutions within Capital One's Enterprise Platforms Technology group. The role involves collaborating with Agile teams, designing and developing technical solutions, working with machine learning and distributed microservices teams, and utilizing various programming languages and cloud-based data warehousing services. The position requires experience in application development, big data technologies, and cloud computing, with preferred qualifications in specific technologies like Spark, Kafka, Snowflake, and real-time data streaming.

What you'd actually do

  1. Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
  2. Work with a team of developers with deep experience in machine learning, distributed microservices, and full stack systems
  3. Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  4. Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  5. Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment

Skills

Required

  • Bachelor’s Degree
  • 4 years of experience in application development
  • 2 years of experience in big data technologies
  • 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)

Nice to have

  • 7+ years of experience in application development including Python, SQL, Scala, or Java
  • 4+ years of experience with a public cloud (AWS, Microsoft Azure, Google Cloud)
  • 4+ years experience with Distributed data/computing tools (MapReduce, Hadoop, Hive, EMR, Kafka, Spark, Gurobi, or MySQL)
  • 4+ year experience working on real-time data and streaming applications
  • 4+ years of experience with NoSQL implementation (Mongo, Cassandra)
  • 4+ years of data warehousing experience (Redshift, Databricks or Snowflake)
  • 2+ years of experience with Agile engineering practices
  • Experience leveraging interactive AI tooling to accelerate productivity, utilizing capabilities beyond basic code completion

What the JD emphasized

  • big data technologies
  • cloud computing
  • real-time data and streaming applications
  • NoSQL implementation
  • data warehousing experience