Software Engineer, Research Data Platform

Anthropic Anthropic · AI Frontier · San Francisco, CA · AI Research & Engineering

Software Engineer to build and operate data pipelines and tooling for AI researchers managing data from training runs, exploring datasets, and analyzing experiments. Focus on data products supporting the research workflow.

What you'd actually do

  1. Build and operate data pipelines that extract data from research training runs and land it in storage systems that are easy and fast to query
  2. Work closely with researchers to design and build APIs, libraries, and web interfaces that support data management, exploration, and analysis
  3. Develop dataset management, data cataloging, and provenance tooling that researchers use in their day-to-day work
  4. Embed with research teams to understand their workflows, identify high-leverage tooling opportunities, and ship solutions quickly
  5. Collaborate with adjacent teams to build on existing systems rather than reinventing them

Skills

Required

  • significant software engineering experience
  • building data-intensive applications or internal tooling
  • working directly with users
  • gathering requirements iteratively
  • shipping things that get adopted
  • results-oriented
  • bias towards flexibility and impact

Nice to have

  • Large-scale ETL
  • columnar storage formats
  • query engines (e.g., Spark, BigQuery, DuckDB, Parquet)
  • High-volume time series data — ingestion, storage, and efficient querying
  • Data cataloging, lineage, or metadata management systems
  • ML experiment tracking or metrics platforms
  • Working in environments where engineers partner closely with quantitative users
  • Complex data visualization
  • full-stack web application development

Other signals

  • builds tools for researchers
  • data pipelines for training runs
  • dataset management tooling