Sr Advanced Data Engr

Honeywell Honeywell · Industrial · Hyderabad, Telangana, India

Senior Advanced Data Engineer with 9+ years of experience to architect, build, and scale cloud-native data and AI platforms on Azure using Databricks. This role requires deep expertise in data engineering, lakehouse architecture, and AI/ML data pipelines to enable advanced analytics, machine learning, and business intelligence use cases. The candidate will lead enterprise-scale data initiatives, collaborate with data scientists and ML engineers, and contribute to the data and AI strategy.

What you'd actually do

  1. Architect, design, and lead the development of end‑to‑end data pipelines on Azure using Databricks (Spark / PySpark)
  2. Own the design and evolution of lakehouse architecture using Azure Data Lake Storage (ADLS Gen2) and Delta Lake
  3. Build, optimize, and scale batch and streaming pipelines for large‑volume, high‑velocity datasets
  4. Design and manage feature engineering pipelines and curated datasets for AI/ML model training, validation, and inference
  5. Partner closely with Data Scientists and ML Engineers to enable scalable, production‑ready ML workflows

Skills

Required

  • Python
  • PySpark
  • Spark SQL
  • Databricks
  • Azure Data Lake Storage (ADLS Gen2)
  • Azure Databricks
  • Azure Data Factory
  • Synapse Pipelines
  • Delta Lake
  • SQL
  • CI/CD
  • Git
  • DevOps

Nice to have

  • Azure Machine Learning
  • Databricks ML
  • Feature Store
  • MLflow
  • experiment tracking
  • Kafka
  • Azure Event Hubs
  • Spark Structured Streaming
  • dbt
  • Unity Catalog
  • enterprise data governance tools
  • Power BI
  • MLOps
  • Technical Lead
  • Principal Engineer
  • Architecture Owner
  • LangChain
  • Agent
  • Agent Architecture

What the JD emphasized

  • 9+ years of hands‑on experience in Data Engineering, Data Platform, or Big Data roles
  • Deep expertise in Python, PySpark, and Spark SQL
  • Extensive, real‑world experience with Databricks
  • Strong experience with Azure cloud services
  • Expert‑level understanding of Delta Lake
  • Proven experience designing AI/ML data pipelines (training, validation, inference datasets)
  • Strong understanding of lakehouse, data warehousing, and dimensional modeling concepts
  • Hands‑on experience with CI/CD pipelines, Git, and DevOps practices for data platforms
  • Excellent troubleshooting, diagnostics, and performance tuning skills
  • Strong communication and stakeholder collaboration abilities

Other signals

  • architect, build, and scale cloud-native data and AI platforms on Azure using Databricks
  • lead complex, enterprise-scale data initiatives
  • work closely with data scientists and ML engineers
  • shaping the organization’s data and AI strategy
  • Design and manage feature engineering pipelines and curated datasets for AI/ML model training, validation, and inference
  • Partner closely with Data Scientists and ML Engineers to enable scalable, production-ready ML workflows
  • Support and integrate with MLOps pipelines
  • Lead optimization of Databricks workloads for performance, scalability, reliability, and cost efficiency
  • Define and implement data quality, validation, monitoring, and observability frameworks
  • Enforce data security, governance, and compliance
  • Mentor and technically guide senior, mid-level, and junior data engineers
  • Lead architectural decision-making and contribute to long-term data platform and AI roadmap planning
  • Act as a technical authority and escalation point for complex data engineering challenges