Advanced Software Engr

Honeywell Honeywell · Industrial · India

Senior Data Engineer with 6+ years of experience to design, build, and scale cloud-native data and AI platforms on Azure using Databricks. The role requires strong hands-on expertise in data engineering, lakehouse architecture, and AI/ML data pipelines to support advanced analytics, machine learning, and business intelligence use cases. The ideal candidate will lead complex data initiatives, collaborate closely with data scientists and ML engineers, and play a key role in shaping the organization’s data and AI strategy.

What you'd actually do

  1. Architect and develop end‑to‑end data pipelines on Azure using Databricks (Spark / PySpark)
  2. Design and maintain lakehouse architectures using Azure Data Lake + Delta Lake
  3. Build and optimize batch and streaming pipelines for large‑scale datasets
  4. Create and manage feature pipelines and curated datasets for AI/ML model training and inference
  5. Collaborate with data scientists and ML engineers to enable scalable ML workflows

Skills

Required

  • Python
  • PySpark
  • Spark SQL
  • Databricks
  • Azure Cloud services
  • Azure Data Lake Storage (ADLS Gen2)
  • Azure Databricks
  • Azure Data Factory / Synapse Pipelines
  • Delta Lake
  • Advanced SQL
  • AI/ML data pipelines
  • data warehousing
  • lakehouse architecture
  • dimensional modeling
  • CI/CD
  • Git
  • DevOps practices
  • troubleshooting
  • performance tuning
  • problem-solving
  • LangChain
  • Agent
  • Agent Architecture

Nice to have

  • ML platforms such as Azure Machine Learning or Databricks ML
  • Feature Store
  • MLflow
  • experiment tracking
  • Streaming data experience (Kafka, Event Hubs, Spark Structured Streaming)
  • dbt
  • Unity Catalog
  • data governance tools
  • BI and visualization tools (Power BI preferred)
  • MLOps best practices
  • production ML systems
  • technical lead
  • mentor

What the JD emphasized

  • AI/ML data pipelines
  • MLOps pipelines
  • data quality, validation, monitoring, and observability frameworks
  • data security, governance, and compliance
  • LangChain , Agent, Agent Architecture

Other signals

  • design and build scalable cloud-native data and AI platforms
  • lead complex data initiatives
  • shape the organization’s data and AI strategy
  • architect and develop end-to-end data pipelines
  • create and manage feature pipelines and curated datasets for AI/ML model training and inference
  • collaborate with data scientists and ML engineers to enable scalable ML workflows
  • support MLOps pipelines
  • implement data quality, validation, monitoring, and observability frameworks
  • ensure data security, governance, and compliance