Data Engineer, Safeguards

Anthropic Anthropic · AI Frontier · London, United Kingdom · Safeguards (Trust & Safety)

Data Engineer for the Safeguards team, responsible for building data pipelines, warehousing solutions, and analytical tooling to support AI safety and trust efforts. The role focuses on data infrastructure for monitoring models, preventing misuse, and ensuring user well-being.

What you'd actually do

  1. Design, build, and maintain scalable data pipelines that support safety monitoring, abuse detection, and enforcement workflows
  2. Develop and optimize data models and warehousing solutions to enable efficient analysis of large-scale usage and safety data
  3. Build and maintain dashboards and reporting infrastructure that give Safeguards teams visibility into model behavior, misuse patterns, and enforcement outcomes
  4. Collaborate with engineers to integrate data from multiple sources — including model outputs, user reports, and automated classifiers — into a unified analytical layer
  5. Implement data quality frameworks, monitoring, and alerting to ensure the reliability of safety-critical data

Skills

Required

  • SQL
  • Python
  • ETL/ELT pipeline development
  • data modeling
  • data warehousing
  • dashboarding
  • data visualization

Nice to have

  • trust & safety data systems
  • integrity data systems
  • fraud detection data systems
  • abuse detection data systems
  • large-scale event streaming systems (Kafka, Pub/Sub, Kinesis)
  • ML model monitoring data infrastructure
  • ML model evaluation data infrastructure
  • data privacy frameworks (GDPR, CCPA)
  • statistical analysis
  • collaboration with data scientists
  • internal tooling development
  • self-service analytics platforms

What the JD emphasized

  • 3+ years of experience in data engineering
  • SQL and Python
  • modern data stack tools such as dbt, Airflow, Spark
  • cloud data platforms (BigQuery, Redshift, Snowflake)
  • building dashboards and data visualizations
  • data privacy and compliance frameworks (GDPR, CCPA, or similar)

Other signals

  • data pipelines
  • data warehousing
  • analytical tooling
  • safety monitoring
  • abuse detection