Advanced Data Engineer - Gcp

Honeywell Honeywell · Industrial · Bengaluru, Karnataka, India

This role focuses on designing, implementing, and managing data architecture, systems, and processes for collecting, storing, processing, and analyzing high-volume, high-dimensional data. The primary responsibilities include creating and maintaining scalable data pipelines, data warehouses, and data lakes, ensuring data quality and availability, and enforcing data governance and security policies. The role involves working with product owners to define data requirements, collaborating with data scientists and analysts, and developing ETL processes. Experience with cloud platforms (GCP, Azure), databases (Snowflake, Oracle, BigQuery), data modeling, and scripting languages (Python, SQL, PySpark) is required.

What you'd actually do

  1. Work in complex data science and analytics projects in support of the VECE organization
  2. Work with product owner to identify the data requirements and design/ maintain/ optimize data pipeline to ingest, transform, and load structured and unstructured data from various sources into the data warehouse or data lake
  3. Design and implement data models and schemas to support analytical and reporting requirements
  4. Collaborate with data scientists & analysts to define and structure data for effective analysis & reporting
  5. Develop and maintain ETL (Extract, Transform, Load) processes

Skills

Required

  • Data Engineering
  • ETL Development
  • Database Administration
  • Snowflake
  • Oracle
  • Big Query
  • Data Modelling
  • Schema Design
  • Azure Databricks
  • CI/CD
  • Dev Ops Process
  • Google Cloud
  • Azure
  • Python
  • SQL
  • PySpark
  • Structured data
  • Unstructured data
  • Agile development methodology

Nice to have

  • NoSQL system (HBase, Cassandra, MongoDB)
  • Data integration tools
  • SciKit
  • TensorFlow
  • Pytorch
  • GPT
  • Bit bucket
  • Technical vision communication
  • Mentoring ability
  • Effective communication skills

What the JD emphasized

  • 5 to 8 years of relevant experience in Data Engineering, ETL Development, Database Administration.
  • Experience in Snowflake, Oracle, Big Query
  • Experience in Azure Databricks, CI/CD & Dev Ops Process
  • Experience in Google Cloud, Azure, CI/CD & Dev Ops Process
  • Expert in scripting and querying languages, such as Python, SQL, PySpark
  • Experience with both Structured and Unstructured data