Data Platform Engineer, Fauna

Amazon Amazon · Big Tech · NY +1 · Business Intelligence

Data Platform Engineer for Fauna Robotics, focused on building foundational data systems for robotics and ML development. This role involves designing and implementing infrastructure for data collection, storage, processing, and transformation from robot sensors, video, and logs, enabling ML training and fleet performance monitoring.

What you'd actually do

  1. Design and build scalable data pipelines for ingesting and processing robotics data (sensor streams, video, telemetry, logs)
  2. Develop and maintain data storage solutions optimized for diverse data types and access patterns
  3. Create tools and APIs for researchers and engineers to efficiently query and analyze large datasets
  4. Build real-time data processing systems for monitoring robot fleet performance
  5. Build and maintain data transformation pipelines that prepare robotics data for ML training

Skills

Required

  • Data engineering
  • Data modeling
  • Data warehousing
  • ETL pipeline development
  • SQL
  • Python
  • Java
  • Scala
  • NodeJS
  • Mentoring

Nice to have

  • Hadoop
  • Hive
  • Spark
  • EMR
  • Big data technologies
  • Large data warehouse operation

What the JD emphasized

  • 5+ years of data engineering experience
  • Experience with data modeling, warehousing and building ETL pipelines
  • Experience with SQL
  • Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
  • Experience mentoring team members on best practices

Other signals

  • data pipelines for robotics data
  • data storage solutions
  • data transformation pipelines for ML training
  • real-time data processing