Member of Technical Staff (data Scientist)

Perplexity Perplexity · AI Frontier · San Francisco, CA · Data Science

This role focuses on building AI systems to revolutionize data science workflows. The core responsibilities include developing AI agents for autonomous data analysis, making data warehouses AI-readable, automating data lifecycle management with self-healing pipelines, and creating AI-powered experiment analysis tools. The goal is to transform the data team into an AI-native product team, enabling significant leverage and industry-leading practices.

What you'd actually do

  1. Accelerate the AI-native data workflow - the team is already working this way. You'll take what's working and turn it into repeatable systems, scalable tools, and patterns that the data team and the entire company can adopt
  2. Build AI agents that do data science - not just answer SQL questions, but conduct end-to-end analyses: explore data, form hypotheses, run queries, interpret results, and generate actionable recommendations
  3. Make the warehouse AI-readable - build the semantic layer, context, and retrieval infrastructure that lets any AI system (internal or product) query Perplexity's data accurately and reliably
  4. Automate the data lifecycle - self-healing pipelines, automated dbt model generation and validation, data quality agents that detect, diagnose, and fix issues autonomously
  5. Ship AI-powered experiment analysis - agents that interpret A/B test results, flag statistical issues, and draft ship/no-ship recommendations for product teams

Skills

Required

  • 6-8+ years in data science, analytics engineering, or a related role
  • Strong product sense
  • Deep SQL expertise
  • Pipeline experience
  • Enough software engineering chops to be dangerous
  • Genuinely excited about AI
  • Builder mentality
  • Autonomy

Nice to have

  • dbt (building and maintaining production models)
  • Snowflake administration and optimization
  • Slack bots, internal CLI tools, or developer productivity tools
  • AI agent frameworks
  • BI tools
  • A/B testing and experimentation
  • Early-stage startup experience

What the JD emphasized

  • build AI systems that fundamentally change how data science gets done
  • build AI agents that conduct full analyses autonomously
  • Make the entire data warehouse AI-readable
  • create self-healing pipelines that detect and fix data issues before anyone notices
  • build the infrastructure that turns a small data team into one that operates at 10x its size
  • building with LLMs on your own time
  • building agents, RAG systems, or AI-powered workflows

Other signals

  • AI agents that conduct full analyses autonomously
  • Make the entire data warehouse AI-readable
  • Self-healing pipelines that detect and fix data issues before anyone notices
  • Build the infrastructure that turns a small data team into one that operates at 10x its size