What you'd actually do

Work in complex data science and analytics projects in support of the VECE organization

Work with product owner to identify the data requirements and design/ maintain/ optimize data pipeline to ingest, transform, and load structured and unstructured data from various sources into the data warehouse or data lake

Design and implement data models and schemas to support analytical and reporting requirements

Collaborate with data scientists & analysts to define and structure data for effective analysis & reporting

Develop and maintain ETL (Extract, Transform, Load) processes

Skills

Required

Data Engineering
ETL Development
Database Administration
Snowflake
Oracle
Big Query
Data Modelling
Schema Design
Azure Databricks
CI/CD
Dev Ops Process
Google Cloud
Azure
Python
SQL
PySpark
Structured data
Unstructured data
Agile development methodology

Nice to have

NoSQL system (HBase, Cassandra, MongoDB)
Data integration tools
SciKit
TensorFlow
Pytorch
GPT
Bit bucket
Technical vision communication
Mentoring ability
Effective communication skills

What the JD emphasized

5 to 8 years of relevant experience in Data Engineering, ETL Development, Database Administration.

Experience in Snowflake, Oracle, Big Query

Experience in Azure Databricks, CI/CD & Dev Ops Process

Experience in Google Cloud, Azure, CI/CD & Dev Ops Process

Expert in scripting and querying languages, such as Python, SQL, PySpark

Experience with both Structured and Unstructured data

Advanced Data Engineer (B3) - (ML/AI)

Position Description:** **

Honeywell's Value Engineering (VE) and Component Engineering (CE) Center of Excellence (COE) is a dynamic collective of professionals dedicated to refining product development through innovative engineering and strategic component selection.

You will be part of Honeywell's VE/CE CoE Advanced Tech team. In this role as Sr Advanced Data Engineer, you will design, implement, and manage the data architecture, systems, and processes to effectively collect, store, process and analyze high volume, high dimensional data to provide strategic insight into complex business problems. This will involve creating and maintaining scalable, efficient, and secure data pipelines, data warehouses, and data lakes. You need to ensure consistency in data quality and availability for analysis and reporting including compliance with data governance and security

Key Responsibilities:

Work in complex data science and analytics projects in support of the VECE organization
Work with product owner to identify the data requirements and design/ maintain/ optimize data pipeline to ingest, transform, and load structured and unstructured data from various sources into the data warehouse or data lake
Design and implement data models and schemas to support analytical and reporting requirements
Collaborate with data scientists & analysts to define and structure data for effective analysis & reporting
Develop and maintain ETL (Extract, Transform, Load) processes
Administer, optimize, and manage databases, data warehouses, and data lakes to ensure performance, reliability, and scalability
Enforce data governance policies, standards, and best practices to maintain data quality, privacy, and security
Create and maintain comprehensive documentation for data architecture, processes, and systems
Troubleshoot and resolve data-related problems and optimize system performance
Partner with IT support team on production processes, continuous improvement, and production deployments

**YOU MUST HAVE: **

5 to 8 years of relevant experience in Data Engineering, ETL Development, Database Administration.
Experience in Snowflake, Oracle, Big Query
Experience in Data Modelling techniques including schema design for both rational and NoSQL databases
Experience in Azure Databricks, CI/CD & Dev Ops Process
Experience in Google Cloud, Azure, CI/CD & Dev Ops Process
Expert in scripting and querying languages, such as Python, SQL, PySpark
Experience with both Structured and Unstructured data
Knowledge of Agile development methodology

WE VALUE

Working with at least one NoSQL system (HBase, Cassandra, MongoDB)
Knowledge of databases, data warehouse platforms (Big Query, Snowflake) and Cloud based tools.
Experience in using data integration tools for ETL processes.
Knowledge in cutting-edge packages such as SciKit, TensorFlow, Pytorch, GPT, PySpark, Bit bucket etc.
Ability to develop and communicate technical vision for projects and initiatives that can be understood customers and management.
Proven mentoring ability to drive results and technical growth in peers.
Effective communication skills (verbal, written, and presentation) for interacting with customers and peers.

Advanced Data Engineer (B3) - (ML/AI)

Position Description:** **

Key Responsibilities:

Work in complex data science and analytics projects in support of the VECE organization
Work with product owner to identify the data requirements and design/ maintain/ optimize data pipeline to ingest, transform, and load structured and unstructured data from various sources into the data warehouse or data lake
Design and implement data models and schemas to support analytical and reporting requirements
Collaborate with data scientists & analysts to define and structure data for effective analysis & reporting
Develop and maintain ETL (Extract, Transform, Load) processes
Administer, optimize, and manage databases, data warehouses, and data lakes to ensure performance, reliability, and scalability
Enforce data governance policies, standards, and best practices to maintain data quality, privacy, and security
Create and maintain comprehensive documentation for data architecture, processes, and systems
Troubleshoot and resolve data-related problems and optimize system performance
Partner with IT support team on production processes, continuous improvement, and production deployments

**YOU MUST HAVE: **

5 to 8 years of relevant experience in Data Engineering, ETL Development, Database Administration.
Experience in Snowflake, Oracle, Big Query
Experience in Data Modelling techniques including schema design for both rational and NoSQL databases
Experience in Azure Databricks, CI/CD & Dev Ops Process
Experience in Google Cloud, Azure, CI/CD & Dev Ops Process
Expert in scripting and querying languages, such as Python, SQL, PySpark
Experience with both Structured and Unstructured data
Knowledge of Agile development methodology

WE VALUE

Working with at least one NoSQL system (HBase, Cassandra, MongoDB)
Knowledge of databases, data warehouse platforms (Big Query, Snowflake) and Cloud based tools.
Experience in using data integration tools for ETL processes.
Knowledge in cutting-edge packages such as SciKit, TensorFlow, Pytorch, GPT, PySpark, Bit bucket etc.
Ability to develop and communicate technical vision for projects and initiatives that can be understood customers and management.
Proven mentoring ability to drive results and technical growth in peers.
Effective communication skills (verbal, written, and presentation) for interacting with customers and peers.