Senior Data Scientist - Shopping Experience (search)

Instacart Instacart · Consumer · Canada · Remote · Data Science

Senior Data Scientist focused on Search experience at Instacart. This role involves owning core search metrics, designing and analyzing experiments for ranking, retrieval, and UX, and partnering with ML teams to improve relevance, quality, and latency. The position requires strong analytical skills, A/B testing experience, and the ability to translate complex data into actionable recommendations. Experience in search relevance, NLP, and bridging offline/online metrics is preferred.

What you'd actually do

  1. Own core Search metrics and funnels end to end (e.g., query → impression → engagement → cart adds), including defining guardrails, monitoring performance across platforms and segments, and diagnosing conversion gaps.
  2. Design, run, and interpret experiments across ranking, retrieval, and search UX (e.g., relevance model changes, query understanding, result layouts), turning ambiguous or conflicting outcomes into crisp, data-driven recommendations.
  3. Partner with Product, Engineering, and ML to prioritize opportunities, size impact, and influence the roadmap for relevance, quality, and latency improvements that unlock measurable business outcomes.
  4. Build deep diagnostic analyses by query class, price point, surface, and customer lifecycle to pinpoint where and why Search underperforms and specify concrete changes that will move key outcomes.
  5. Connect offline model evaluation with online and business metrics by collaborating with ML partners on evaluation design, ensuring model changes reliably improve end-user experience—not just offline scores.

Skills

Required

  • 5+ years of experience in data science or product analytics, with a track record of impact on consumer-facing products.
  • Advanced SQL proficiency, including complex joins and window functions, working with large-scale datasets in modern data warehouses (e.g., Snowflake, BigQuery, Redshift).
  • Proficiency in Python or R for analysis, experimentation, and modeling.
  • Hands-on experience designing and analyzing A/B tests end to end, including metric selection, power and sample sizing, covariate adjustment, and decision-making under uncertainty.
  • Demonstrated ability to define success metrics, decompose ambiguous product problems, and deliver clear, opinionated recommendations to Product and Engineering partners.
  • Excellent written and verbal communication skills; able to tailor complex analyses for both technical and non-technical audiences.
  • Bachelor’s degree in a quantitative field (e.g., Statistics, Computer Science, Mathematics, Economics, Engineering) or equivalent practical experience.
  • Comfort using modern AI tooling (e.g., Claude, code assistants, PromptQL) to accelerate analysis, experimentation, and communication while exercising strong judgment on quality and reliability.

Nice to have

  • Experience in search relevance, ranking, recommendations, personalization, or information retrieval (e.g., e-commerce or marketplace search).
  • Familiarity with NLP, embeddings, and semantic search, including how to evaluate and iterate on these techniques in production.
  • Experience bridging offline evaluation metrics (e.g., NDCG, precision/recall, human evaluation) with online experiments and business outcomes.
  • Background in causal inference beyond standard A/B tests (e.g., holdouts, diff-in-diff, quasi-experiments) to measure long-term or cross-surface effects.
  • Comfort working across web and native app surfaces, navigating tradeoffs between relevance, monetization, and latency.
  • Proven impact improving logging, instrumentation, and metric definitions in complex data environments.

What the JD emphasized

  • search relevance
  • ranking quality
  • latency improvements
  • model evaluation
  • search relevance
  • ranking quality
  • latency improvements
  • model evaluation

Other signals

  • search relevance
  • ranking quality
  • customer intent
  • experimentation