Advanced Data Engr

Honeywell Honeywell · Industrial · Hyderabad, Telangana, India

Senior Data Engineer to design, build, and scale cloud-native data and AI platforms on Azure using Databricks, focusing on data engineering, lakehouse architecture, and AI/ML data pipelines to support advanced analytics and ML use cases. The role involves leading data initiatives, collaborating with data scientists and ML engineers, and contributing to the data and AI strategy.

What you'd actually do

  1. Architect and develop end‑to‑end data pipelines on Azure using Databricks (Spark / PySpark)
  2. Design and maintain lakehouse architectures using Azure Data Lake + Delta Lake
  3. Build and optimize batch and streaming pipelines for large‑scale datasets
  4. Create and manage feature pipelines and curated datasets for AI/ML model training and inference
  5. Collaborate with data scientists and ML engineers to enable scalable ML workflows

Skills

Required

  • Python
  • PySpark
  • Spark SQL
  • Databricks
  • Azure Data Lake Storage (ADLS Gen2)
  • Azure Databricks
  • Azure Data Factory / Synapse Pipelines
  • Delta Lake
  • SQL
  • Data Warehousing
  • Lakehouse Architecture
  • Dimensional Modeling
  • CI/CD
  • Git
  • DevOps
  • Troubleshooting
  • Performance Tuning
  • Problem-solving
  • DAG based workflows

Nice to have

  • Azure Machine Learning
  • Databricks ML
  • Feature Store
  • MLflow
  • Experiment Tracking
  • Kafka
  • Event Hubs
  • Spark Structured Streaming
  • dbt
  • Unity Catalog
  • Data Governance
  • Power BI
  • MLOps
  • Technical Lead
  • Mentor
  • LangChain
  • Agent
  • Agent Architecture

What the JD emphasized

  • 6+ years of hands‑on experience in Data Engineering or Data Platform roles
  • Strong proficiency in Python, PySpark, and Spark SQL
  • Extensive experience with Databricks (jobs, notebooks, workflows, Delta Live Tables)
  • Strong experience with Azure Cloud services
  • Solid understanding of Delta Lake, including optimization and ACID guarantees
  • Advanced SQL skills for analytical data modeling
  • Experience designing AI/ML data pipelines (training, validation, inference datasets)
  • Develop, orchestrate and maintain scalable data pipelines using DAG based workflows to ensure reliable and efficient data processing

Other signals

  • design and build cloud-native data and AI platforms
  • support advanced analytics, machine learning, and business intelligence use cases
  • lead complex data initiatives
  • collaborate closely with data scientists and ML engineers
  • shaping the organization’s data and AI strategy