Senior Data Engineer, Cybersecurity

Workday Workday · Enterprise · Dublin, Ireland

Senior Data Engineer focused on data logistics, transformation, enrichment, MDM, data quality, cataloging, and governance within a cybersecurity context. The role involves designing and optimizing scalable data pipelines, ETL/ELT workflows, and data enrichment layers, ensuring data usability and integrity for enterprise-wide use. Responsibilities include data documentation, cataloging, MDM, designing downstream data marts, and enforcing data governance policies.

What you'd actually do

  1. Design, implement, and optimize highly scalable, resilient data ingestion and transformation pipelines from disparate internal and external sources.
  2. Lead the strategy for Master Data Management (MDM) within our pipelines. Architect sophisticated data stitching, deduplication, and enrichment processes to establish a single source of truth for core business entities.
  3. Implement comprehensive data quality and observability frameworks (e.g., data profiling, anomaly detection, SLA tracking) to ensure absolute trust in pipeline outputs.
  4. Architect and build optimized, user-focused Data Marts and semantic layers. Translate complex source data into clean, dimensional models (Kimball/Inmon, Star Schema) tailored for business intelligence and analytics.
  5. Serve as the primary champion for data discoverability. Build, maintain, and automate enterprise Data Dictionaries, Data Catalogs, and Data Logs to ensure data lineage and definitions are transparent across the org.

Skills

Required

  • Data pipeline architecture
  • ETL/ELT development
  • Data enrichment
  • Master Data Management (MDM)
  • Data quality frameworks
  • Data observability
  • Data Mart design
  • Semantic layer design
  • Data cataloging
  • Data documentation
  • Data governance
  • AWS Glue
  • AWS EMR
  • Apache Spark (PySpark or Scala)
  • Data modeling (Star/Snowflake schemas, dimensional modeling)
  • Python
  • SQL

Nice to have

  • OpenLineage
  • dbt docs
  • Collibra
  • Alation
  • Apache Atlas
  • AWS Glue Data Catalog
  • Apache Airflow
  • Prefect
  • AWS Step Functions
  • Data Lakehouse architecture
  • Data Mesh
  • Apache Iceberg
  • Delta Lake
  • CI/CD pipelines (GitHub Actions, Jenkins)
  • Automated unit/integration testing

What the JD emphasized

  • 5+ years of hands-on data engineering experience
  • Advanced, hands-on experience with AWS Glue
  • Deep proficiency with Apache Spark
  • Expert knowledge of data warehousing concepts
  • Expert-level Python and SQL