Senior Data Architect

Oura Oura · Consumer · Helsinki, Finland · Data Engineering & Analytics

Senior Data Architect role focused on building data foundations for AI products, including LLM reporting and Agentic AI, using Data Mesh principles and cloud platforms like Databricks and Google BigQuery. The role involves designing vector-based architectures and RAG patterns, and implementing data governance for privacy and compliance (HIPAA/PHI).

What you'd actually do

  1. Design and manage data domains to enable the creation of interoperable, trustworthy data products.
  2. Build and optimize Oura’s Data Lakehouse leveraging Databricks, Google BigQuery, and Snowflake to process Terabyte-Petabyte scale data.
  3. Implement federated data governance within the data mesh to ensure processes meet privacy, compliance (HIPAA/PHI), and security requirements.
  4. Partner with Data Engineering, Data Science, and Business Domain owners to advocate for unified analytics and modeling best practices.
  5. Design vector-based data architectures and Retrieval Augmented Generation (RAG) patterns to enable LLM reporting and Agentic AI.

Skills

Required

  • Data architecture
  • Data modeling
  • Cloud-based platforms (AWS, GCP, Databricks or Azure)
  • Databricks
  • Google BigQuery
  • Snowflake
  • Data Mesh principles
  • Federated data governance
  • HIPAA
  • PHI
  • Vector-based data architectures
  • Retrieval Augmented Generation (RAG)
  • Agentic AI
  • Master Data Management (MDM)
  • Reference Data Management (RDM)
  • Iceberg
  • dbt
  • dbt Cloud
  • AI/ML integration
  • VertexAI
  • MLOps frameworks
  • Large Language Models (LLM)
  • Kafka
  • Kinesis
  • Python
  • Spark
  • SQL
  • Airflow
  • Dagster
  • Databricks Lakeflow

Nice to have

  • AWS (S3, Kinesis, Glue, Athena)
  • Azure
  • Docker
  • Pulumi
  • workflow engines
  • Fivetran
  • Observability

What the JD emphasized

  • Data Mesh
  • HIPAA
  • PHI
  • vector-based data architectures
  • Retrieval Augmented Generation (RAG)
  • Agentic AI

Other signals

  • AI Readiness
  • LLM reporting
  • Agentic AI
  • RAG patterns
  • vector-based data architectures