Data Engineer, Gtm & Product

Cognition Cognition · Coding AI · San Francisco, CA · General & Administrative

This role is for a Data Engineer at an applied AI lab building end-to-end software agents like Devin, the AI software engineer, and Windsurf, the AI-native IDE. The engineer will own the full data stack, including database architecture, pipelines, integrations, and reporting, with a focus on product and GTM reporting. The goal is to ensure data is reliable, accessible, and actionable.

What you'd actually do

  1. Design and manage database architecture and data models
  2. Build and maintain ETL/ELT pipelines and orchestration workflows
  3. Create and manage new data integrations across internal and external systems
  4. Own business reporting: datasets, dashboards, metrics, and self-serve analytics
  5. Ensure data quality, observability, governance, and documentation

Skills

Required

  • Expert SQL
  • strong Python (or R)
  • Experience with data modeling, warehouse architecture, and BI-oriented schema design
  • Hands-on experience with ETL/ELT tools (dbt, Airflow, Dagster, etc.)
  • Experience building or maintaining BI reporting (Metabase a plus)
  • Strong knowledge of statistics and experimentation

Nice to have

  • Metabase a plus

What the JD emphasized

  • 4+ years in a data engineering, data science, or full-stack data role

Other signals

  • building end-to-end software agents
  • makers of Devin, the first AI software engineer
  • AI-native IDE
  • AI that can reason on real-world tasks