Software Developer I - Data Platform

Redfin Redfin · Seattle · Seattle, WA

Early-career Software Developer focused on building and maintaining data pipelines and infrastructure for a data platform. The role involves ingesting, processing, and delivering large-scale datasets using technologies like Spark, Python/Java, and Airflow, supporting machine learning, analytics, and product teams. It emphasizes learning distributed systems and data engineering within a cloud data platform environment.

What you'd actually do

  1. Build and maintain scalable data pipelines that ingest, process, and organize large datasets (such as listings, clickstream, and external data sources) into Redfin’s data lake and analytics platforms.
  2. Develop data processing applications using technologies such as Spark, Python/Java, and workflow orchestration tools (e.g., Airflow/Windfarm) to support machine learning, product features, and analytics.
  3. Collaborate with senior engineers and tech leads to improve the reliability, performance, and monitoring of Redfin’s data platform and batch processing infrastructure.
  4. Help modernize legacy data systems by migrating pipelines and datasets to Redfin’s lakehouse architecture and standardized data frameworks.
  5. Contribute to internal tools and services that enable other engineering teams to discover, access, and use data efficiently across Redfin’s platform.

Skills

Required

  • 1-3 years of experience building software systems, data pipelines, or backend services
  • Experience working with relational or non-relational databases
  • Experience with Spark
  • Experience with Python/Java
  • Experience with SQL
  • Experience with Airflow
  • Experience with cloud infrastructure

Nice to have

  • distributed systems
  • data engineering
  • modern data platforms
  • lakehouse architecture