Member of Technical Staff (analytics Engineer)

Perplexity Perplexity · AI Frontier · San Francisco, CA · Data Science

This role focuses on building AI systems to revolutionize how data science is performed. The core responsibilities include developing AI agents for autonomous data analysis (hypothesis generation, query execution, interpretation, recommendations), making the data warehouse AI-readable, automating data lifecycle management with self-healing pipelines, and creating AI-powered experiment analysis tools. The goal is to transform the data team into a product team by building internal data products that leverage AI, enabling 10x operational efficiency.

What you'd actually do

  1. Accelerate the AI-native data workflow - the team is already working this way. You'll take what's working and turn it into repeatable systems, scalable tools, and patterns that the data team and the entire company can adopt
  2. Build AI agents that do data science - not just answer SQL questions, but conduct end-to-end analyses: explore data, form hypotheses, run queries, interpret results, and generate actionable recommendations
  3. Make the warehouse AI-readable - build the semantic layer, context, and retrieval infrastructure that lets any AI system (internal or product) query Perplexity's data accurately and reliably
  4. Automate the data lifecycle - self-healing pipelines, automated dbt model generation and validation, data quality agents that detect, diagnose, and fix issues autonomously
  5. Ship AI-powered experiment analysis - agents that interpret A/B test results, flag statistical issues, and draft ship/no-ship recommendations for product teams

Skills

Required

  • 6-8+ years in data science, analytics engineering, or a related role
  • Strong product sense
  • Deep SQL expertise
  • Pipeline experience
  • Python software engineering skills
  • Experience building with LLMs
  • Experience building agents, RAG systems, or AI-powered workflows
  • Builder mentality
  • Autonomy

Nice to have

  • dbt (building and maintaining production models)
  • Snowflake administration and optimization
  • Slack bots, internal CLI tools, or developer productivity tools
  • AI agent frameworks
  • BI tools
  • A/B testing and experimentation
  • Early-stage startup experience

What the JD emphasized

  • build AI systems that fundamentally change how data science gets done
  • build AI agents that conduct full analyses autonomously
  • Make the entire data warehouse AI-readable
  • create self-healing pipelines that detect and fix data issues before anyone notices
  • You've been building with LLMs on your own time
  • You have opinions about which models are good at what
  • You've tried building agents, RAG systems, or AI-powered workflows

Other signals

  • AI agents that conduct full analyses autonomously
  • Make the entire data warehouse AI-readable
  • Self-healing pipelines that detect and fix data issues
  • Build internal data products that stakeholders across the company actually use daily