Senior Machine Learning Engineer Ii, Ads Response Prediction

Instacart Instacart · Consumer · San Francisco, CA · Machine Learning

Lead research and development of pCTR and conversion prediction models, focusing on calibration, debiasing training data, and advancing accuracy. Contribute to next-generation Multi-Domain Multi-Task (MDMT) models, sequence modeling initiatives like TIGER, and Foundation Models for ads ranking. Formulate ambiguous problems into research directions with clear evaluation criteria.

What you'd actually do

  1. Lead research and development of pCTR and conversion prediction models, with a focus on improving calibration, reducing training data biases (selection bias, position bias, optimizer’s curse), and advancing model accuracy across Instacart’s ads surfaces.
  2. Design and implement debiasing techniques such as Mixed Negative Sampling (MNS), Inverse Propensity Weighting (IPW), counterfactual risk minimization, and calibration methods (Platt scaling, isotonic regression) to address systematic prediction biases.
  3. Contribute to the next-generation Multi-Domain Multi-Task (MDMT) model architecture, incorporating innovations like Mixture-of-Experts (MoE), Transformer layers for sequential user behavior, and LoRA adaptors for scalable domain fine-tuning.
  4. Drive sequence modeling initiatives including the TIGER generative retrieval system and Semantic ID representation learning, expanding their application across ads surfaces such as Product Details, Search and other placements.
  5. Collaborate with the broader ML community in the company on the path toward Foundation Models using autoregressive user behavior prediction.

Skills

Required

  • Python
  • PyTorch
  • Tensorflow
  • JAX
  • SQL
  • Spark
  • Pandas
  • Deep & Wide
  • DeepFM
  • DCN
  • Multi-task learning
  • Causal inference
  • Counterfactual reasoning
  • Selection bias
  • Position bias
  • Propensity-based correction methods

Nice to have

  • Ads ranking
  • Auction-based systems
  • pCTR
  • Bid optimization
  • ROAS feedback loops
  • Marketplace dynamics
  • Autoregressive sequence models
  • User behavior prediction
  • Generative retrieval
  • Transformer-based ranking architectures
  • Learned representations

What the JD emphasized

  • PhD/Master in machine learning, statistics, computer science, information retrieval, or a closely related quantitative field.
  • 6+ years of combined academic and industry experience (including PhD research) applying ML to ranking, recommendation, or prediction problems at scale.
  • Deep understanding of CTR/conversion prediction modeling, including familiarity with architectures such as Deep & Wide, DeepFM, DCN, and multi-task learning formulations.
  • Strong foundation in causal inference, counterfactual reasoning, and training data bias mitigation. Ability to reason about selection bias, position bias, and propensity-based correction methods.

Other signals

  • pCTR modeling
  • multi-task learning
  • sequence modeling
  • foundation models
  • retrieval systems