Data & Service Engineer

Microsoft Microsoft · Big Tech · Barcelona, CT +1 · Data Engineering

Data Engineer role focused on building and operating data ingestion, transformation, and validation pipelines that power experimentation, insights, and AI features for SwiftKey. The role involves working with large-scale data systems in cloud environments, ensuring data quality, and supporting AI feature development.

What you'd actually do

  1. Build and operate production services that ingest, validate, transform, and serve data in cloud-hosted environments (we use containerized and serverless services in Azure).
  2. Design, maintain, and improve data and model infrastructure used to process, store, distribute, and access large datasets, ensuring availability and correctness (we use modern data lake storage and platforms such as Databricks, Azure Synapse, and Spark)
  3. Monitor the health and performance of live data services using telemetry and alerts, investigate service issues, and participate in incident response [and on-call rotations] to reduce disruption for users and downstream systems.
  4. Partner with other engineers, product managers, and applied scientists to deliver high-quality data and production-ready services that support analytics, experimentation, and AI feature development.
  5. Apply security, privacy, and compliance standards across pipelines and services, managing data access and ensuring adherence to applicable policies and regulations.

Skills

Required

  • Python
  • SQL
  • distributed data processing platforms
  • Apache Spark
  • Databricks
  • Azure Synapse
  • large-scale data storage
  • data lake architectures
  • cloud platforms
  • Azure
  • containerized workloads
  • monitoring system health
  • distributed cloud services

Nice to have

  • event-driven data ingestion systems
  • streaming data ingestion systems
  • Azure Event Hubs
  • Kafka
  • large-scale consumer data
  • data governance
  • data compliance
  • data security practices

What the JD emphasized

  • production data pipelines
  • large-scale data storage
  • cloud platforms
  • service engineering practices
  • security, privacy, and compliance standards

Other signals

  • data pipelines
  • data processing
  • AI features
  • ML data engineering