Data Engineer II

Google Google · Big Tech · Boulder, CO +2

Google's gTech Product and Tools Operations team is seeking a Data Engineer II to design, develop, and support data pipelines, warehouses, and reporting systems. This role involves leveraging AI to integrate data into full-stack software solutions, creating robust ETL pipelines, and partnering with stakeholders and data scientists to scale ML models into production. The position requires experience in data pipeline design, dimensional data modeling, and working with data models through exploratory queries.

What you'd actually do

  1. Design, develop, and support data pipelines, warehouses, and reporting systems, leveraging AI to integrate data into full-stack software solutions.
  2. Create robust ETL pipelines and reporting systems for new data while continuously improving existing data models and workflows.
  3. Partner with stakeholders and support engineers to ensure data infrastructure evolves with business, product, and user requirements.
  4. Work closely with data scientists to scale and transition statistical and machine learning models into production pipelines.
  5. Write and clear technical documentation while driving innovation through new analytical tools to unlock deeper data insights.

Skills

Required

  • Bachelor's degree or equivalent practical experience
  • 1 year of experience coding in one or more programming languages
  • 1 year of experience designing data pipelines, and dimensional data modeling for synch and asynch system integration and implementation using internal (e.g., Flume, etc.) and external stacks (DataFlow, Spark, etc.)
  • Experience working with data models by performing exploratory queries and scripts

Nice to have

  • 1 year of experience partnering with stakeholders to deliver projects on time, within budget, and scope
  • Experience writing and maintaining ETLs for structured and unstructured data sources
  • Experience modeling complex business processes or real-world data sources
  • Experience in designing data models, data warehouses, and managing distributed data processing
  • Familiarity with non-relational (NoSQL) data storage systems and Unix/Linux environments
  • Excellent written communication and organizational skills

Other signals

  • integrate data into full-stack software solutions
  • scale and transition statistical and machine learning models into production pipelines