Senior Software Development Engineer, Big Data

Zillow Zillow · Consumer · United States · Remote

Senior Software Development Engineer on the Housing Trends Data Engineering team at Zillow. This role focuses on building and maintaining data pipelines and systems to process large-scale datasets for housing market indicators, forecasts, and insights. The engineer will collaborate with data science and ML teams, implement feature stores, and support the deployment of ML models into production.

What you'd actually do

  1. Build data systems to link raw data and concrete Housing Trends insights.
  2. Build and extend our internal forecasting model development framework.
  3. Develop and maintain scalable data products and deliver pipelines built for speed, accuracy, and consistency that will scale as our stakeholders’ need.
  4. Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability and performance, etc.
  5. Collaborate with data scientists to understand their data requirements and build systems/tools that enable efficient model training and experimentation.

Skills

Required

  • Python programming
  • big data ecosystems, architectures and modern data platforms
  • AWS cloud data services
  • Lakehouse/Lakebase platforms
  • design, build, and orchestrate batch and real-time data pipelines
  • processing large-scale datasets
  • SparkSQL
  • Spark Streaming
  • PostgreSQL
  • EKS
  • Kubernetes
  • Agile/DevOps software development processes
  • GitLab
  • CI/CD
  • data science concepts
  • fundamental machine learning algorithms/libraries
  • development lifecycle
  • root cause analysis
  • customer engagement
  • cross-functional teams
  • technical concepts to technical and non-technical stakeholders
  • problem-solving skills
  • fast-paced, collaborative environment
  • 5+ years of experience in Big Data Engineering or Machine Learning Engineering roles

Nice to have

  • Databricks
  • LLM and Agentic AI technologies and engineering patterns

What the JD emphasized

  • A good understanding of data science concepts, fundamental machine learning algorithms/libraries, and development lifecycle is a must-to-have.