Analytics Engineer III

Box Box · Enterprise · Warsaw, Poland · Business Analytics

This role focuses on building and maintaining data pipelines and infrastructure for a cloud cost management platform within Box. It involves working with large datasets, ensuring data delivery, and supporting analytics and data science teams. The role emphasizes data engineering best practices and utilizing GCP and big data tools.

What you'd actually do

  1. Build and own data pipelines that clean, transform, and aggregate data from disparate sources
  2. Create and maintain optimal data pipeline architecture
  3. Assemble large, complex data sets that meet functional / non-functional business requirements
  4. Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using GCP BigQuery and Spark
  5. Build analytics tools that utilize the data pipeline to provide actionable insights into operational efficiency and other key business performance metrics

Skills

Required

  • 3+ Years of relevant industry or relevant academia experience working with large amounts of data
  • Experience building and optimizing scalable data pipelines, architectures and data sets
  • Expert in SQL
  • Experience with at least one of the programming languages: Scala, Java
  • Experience with scripting language: Python, NodeJS
  • Experience with GCP (BigQuery, Dataproc, Dataflow/Fusion)
  • Experience with big data tools: Hadoop, Spark, Kafka, etc
  • Strong analytic skills related to working with structured and unstructured datasets
  • Experience supporting and working with cross-functional teams in a dynamic environment

Nice to have

  • Be cognizant of emerging technology trends and find adoption opportunities to improve existing development processes
  • Familiarity with Virtualization/container abstractions and orchestration (Kubernetes, Docker, etc.)
  • Familiarity with Visualization software: Tableau
  • Familiarity with frontend web framework: React