Data Engineer

Lyft Lyft · Consumer · Toronto, ON · SCC Tech

Data Engineer on the Safety and Customer Care team, responsible for data modeling and pipelines powering AI agents and human associates. Focus on data quality, reliability, and efficiency improvements.

What you'd actually do

  1. Owner of the core data pipeline, responsible for scaling up data processing flow to meet the rapid data growth at Lyft
  2. Evolve data model and data schema based on business and engineering needs
  3. Implement systems tracking data quality and consistency
  4. Develop tools supporting self-service data pipeline management (ETL)
  5. SQL and MapReduce job tuning to improve data processing performance

Skills

Required

  • Spark
  • Python
  • SQL performance tuning
  • AWS
  • Hadoop/S3
  • Hive
  • Presto
  • Airflow
  • ETL processes
  • workflow orchestration
  • data warehousing

Nice to have

  • large-scale distributed systems

What the JD emphasized

  • critical infrastructure
  • AI agents
  • huge amounts of Lyft data