Data Platform Engineer - Airflow Platform Specialist

Apple Apple · Big Tech · Hyderabad, India · Software and Services

Seeking an experienced Data Platform Engineer with deep expertise in Apache Airflow to build, scale, and maintain a data orchestration platform. This role focuses on building the infrastructure and tooling for other data engineers, not on building data pipelines themselves. Responsibilities include designing, architecting, and maintaining Airflow infrastructure, customizing core functionality, developing custom operators, building internal frameworks, modifying source code, and contributing to the Airflow open-source community. The role also involves deploying and managing Airflow on Kubernetes, optimizing performance, designing multi-tenancy, building auto-scaling, and architecting high-availability solutions. Additionally, it requires implementing monitoring, alerting, and observability, troubleshooting distributed systems, building self-service capabilities, creating tooling for debugging, and establishing SLAs. Integration with Spark, EMR, Kubernetes, and Hadoop, as well as developing authentication/authorization frameworks and integrating with CI/CD and observability stacks, are key. The role also emphasizes developer experience through documentation, best practices, tooling, and consultation.

What you'd actually do

  1. Design, architect, and maintain highly scalable Apache Airflow platform infrastructure
  2. Deep dive into Airflow internals to customize and extend core functionality
  3. Develop custom Airflow operators, sensors, hooks, and plugins for organizational use
  4. Build internal frameworks and abstractions on top of Airflow to simplify DAG authoring
  5. Modify Airflow source code when necessary to meet specific platform requirements

Skills

Required

  • Apache Airflow internals
  • Python
  • Kubernetes
  • Docker
  • AWS EMR
  • Hadoop ecosystem
  • Apache Spark
  • Distributed systems concepts
  • Database performance tuning
  • CI/CD systems
  • Monitoring and observability
  • Software design principles
  • API design and development

Nice to have

  • Active contributions to Apache Airflow open-source project
  • Experience running Airflow at massive scale
  • Experience building multi-tenant data platforms
  • GitOps
  • SRE or platform reliability engineering
  • Digital advertising or high-scale data platforms

What the JD emphasized

  • deep, hands-on experience with Apache Airflow internals
  • customizing and extending Airflow core
  • modify Airflow source code