Data Engineer II

Redfin Redfin · Seattle · Seattle, WA

Data Engineer II role focused on building and maintaining data pipelines and integrations for Redfin's Data Platform. Requires experience with SQL, data modeling, and various data pipeline technologies. The role involves working with engineering and analytics teams to support data needs across multiple systems.

What you'd actually do

  1. Design, develop, and support complex data pipelines from a wide variety of data sources for Redfin
  2. Work with other Engineering teams to design and implement data exchange systems from structured and unstructured data sources
  3. Work with end users to gather requirements and design solutions that provide the the data they need in the format they need it in
  4. Ensure 100% accuracy in everything you do - make sure that all code and processes are fully tested and verified before running in production
  5. Identify and implement internal process improvements by automating manual processes and optimizing for accuracy, performance and coding best practices

Skills

Required

  • 3+ years experience with a modern programming language (Python, Java, etc)
  • 3+ years experience with complex SQL, data modeling and database technologies (Postgres, MySQL, Snowflake, etc)
  • Experience creating and maintaining complex data pipelines
  • Experience with batch and streaming data pipelines
  • Experience with both structured and unstructured data sets
  • Ability to translate business needs into technical requirements
  • Ability to structure data to make it accessible, performant, and resilient to change
  • Ability to handle ambiguity and function in a fast paced, constantly changing environment
  • Ability to communicate well with both technical and non technical users

Nice to have

  • Python is preferred
  • Snowflake is preferred
  • Git / Jenkins / Airflow / Docker / Kubernetes / Spark is a plus
  • API development is a plus
  • cloud services (AWS / Azure / GCP) is a plus
  • AWS is preferred
  • message queuing and stream processing data is a plus