Senior Data Science Engineer

Adobe Adobe · Enterprise · Noida, India

Senior Data Engineer role focused on building and operating large-scale analytical data pipelines and analytics-ready data tables on Azure Databricks using Spark and Python. The role supports product analytics, usage metrics, and experimentation, with a selective use of GenAI/LLM capabilities.

What you'd actually do

  1. Design, build, and maintain large-scale, production-grade data pipelines on Azure Databricks using Apache Spark and Python
  2. Write and optimize complex, high-performance SQL for data transformation, aggregation, and analytics workloads at scale
  3. Develop and maintain analytics-ready data models (fact tables, dimensions, rollups, metric layers, and gold layer tables) used for dashboards and reporting
  4. Optimize Databricks workloads for performance, reliability, and cost efficiency, including tuning Spark jobs and Delta Lake tables
  5. Establish and apply Delta Lake best practices, including incremental and idempotent processing, MERGE patterns, partitioning, and table optimization.

Skills

Required

  • SQL
  • Python
  • Spark
  • Delta Lake
  • Azure Databricks
  • data modeling
  • data warehousing
  • Azure
  • lakehouse architectures

Nice to have

  • streaming data pipelines
  • near-real-time data pipelines
  • MLOps
  • LLM deployment
  • GenAI-enabled data applications
  • BI tools
  • Tableau
  • Power BI

What the JD emphasized

  • Databricks-based analytics data warehouse
  • SQL
  • Python
  • Spark
  • Delta Lake
  • large-scale
  • production-grade data pipelines
  • complex, high-performance SQL
  • data models optimized for analytics and BI consumption
  • Microsoft Azure
  • lakehouse architectures