Sr Principal Data Scientist

Smartsheet Smartsheet · Seattle · India · Business Intelligence & Ops

Sr. Principal Data Scientist at Smartsheet to set technical direction for ML models and AI sub-agents across the customer lifecycle. Responsibilities include architecting systems, defining modeling and evaluation standards, shaping the roadmap for applied ML and agentic AI, and shipping high-leverage models and sub-agents. Requires deep applied ML expertise, architect-level grasp of agentic AI systems, and experience operating ML in production at scale.

What you'd actually do

  1. Set the technical direction for how Smartsheet uses applied ML and agentic AI across the customer lifecycle — what gets built, what doesn’t, and what good looks like
  2. Architect the systems behind the sub-agents: how they ground themselves in evidence, how risk and confidence are calibrated, how decisions are evaluated, and how all of it stays safe and trustworthy at scale
  3. Define the modeling, evaluation, and experimentation standards the team follows — from offline metrics to online rollout to production monitoring
  4. Personally ship the highest-leverage models and sub-agents — the ones that need a senior IC to frame the problem, push through ambiguity, and de-risk the approach
  5. Drive technical decisions on the data foundations and knowledge layer sub-agents reason over, with strong stewardship of privacy, aggregation, and customer trust

Skills

Required

  • Bachelor’s degree and 12+ years of experience (or 14+ years of experience); advanced degree in a quantitative field (Statistics, CS, ML, Economics, Operations Research, or similar) strongly preferred
  • Track record of setting technical direction for applied ML and AI work that ships and matters — major initiatives, complex systems, or new modeling paradigms taken from idea to production impact
  • Deep applied ML expertise across both traditional ML and deep learning: gradient boosting, regularized linear models, transformer-based sequence models, foundation model embeddings, causal ML, contextual bandits, and offline RL
  • Architect-level grasp of agentic AI systems: tool use, retrieval, multi-step reasoning, evaluation, guardrails, and the patterns for keeping all of it reliable in production
  • Strong grasp of causal inference for intervention design and lifecycle modeling: uplift modeling, difference-in-differences, propensity scoring, and synthetic control
  • Solid foundation in statistics and experimental design at scale: hypothesis testing, power analysis, multiple comparisons, sequential testing, and quasi-experimental methods
  • Experience operating ML in production at scale — feature engineering and pipelines, model monitoring, drift detection, retraining cadence, and the trade-offs between batch and real-time serving
  • Proficient in SQL and Python; comfort with ML/LLM tooling at scale (Spark, Databricks, Snowflake, or equivalents), ML frameworks (PyTorch, scikit-learn, XGBoost/LightGBM), and visualization tools (Tableau or similar)
  • Experience leading lifecycle modeling work — churn, expansion, adoption, plan health, lead/account scoring — and business fluency in the SaaS metrics that drive it (NRR, GRR, ARR, and cohort economics)
  • A pragmatic production bar: latency, cost, monitoring, drift, hallucination, and what happens when the model or sub-agent is wrong
  • Demonstrated ability to influence senior product and engineering leaders and to mentor staff and principal data scientists — your track record shows people and decisions, not just models, getting better
  • Comfort operating in deep ambiguity — defining the problem, choosing the approach, and aligning the team when no playbook exists

Nice to have

  • advanced degree in a quantitative field (Statistics, CS, ML, Economics, Operations Research, or similar) strongly preferred

What the JD emphasized

  • set the technical direction
  • architect the systems
  • define the modeling, evaluation, and experimentation standards
  • ship the highest-leverage models and sub-agents
  • drive technical decisions
  • partner with senior Product, Engineering, and Applied AI leaders
  • raise the bar for the data science team
  • Track record of setting technical direction for applied ML and AI work that ships and matters
  • Architect-level grasp of agentic AI systems
  • Experience operating ML in production at scale
  • A pragmatic production bar
  • Demonstrated ability to influence senior product and engineering leaders
  • Comfort operating in deep ambiguity

Other signals

  • AI agents
  • applied ML
  • customer lifecycle
  • petabyte-scale data
  • technical leadership