Senior Data Scientist

Walmart Walmart · Retail · TX DALLAS

Senior Data Scientist role at Walmart focused on developing, deploying, and evaluating machine learning models and algorithms. Responsibilities include data quality checks, model development using Python and Pyspark, model deployment, and applying statistical methods and data science techniques for business problems. The role involves working with data visualization tools and presenting findings to business stakeholders.

What you'd actually do

  1. Write code to develop the required solution and application features by using the recommended programming language and leveraging business, technical, and data requirements.
  2. Apply best practice techniques for model testing and tuning to assess accuracy, fit, validity, and robustness for multi-stage models and model ensembles.
  3. Support efforts to ensure that analytical models and techniques used can be deployed into production.
  4. Perform exploratory data analysis activities on available data.
  5. Generate appropriate graphical representations of data and model outcomes.

Skills

Required

  • Experience writing production code in Python
  • Experience with database technologies and tools, including SQL
  • Experience with big data analytics using Pyspark
  • Experience developing machine learning models and algorithms
  • Experience deploying machine learning models and algorithms
  • Experience with Machine Learning (ML) frameworks: scikit learn, TensorFlow, and torch
  • Experience with statistical methods: multivariate analysis and Bayesian Statistics
  • Experience with Data Science modeling techniques: cluster analysis, Random Forest, and Boosting
  • Experience with exploratory data analysis: statistical analysis, statistical inference, and hypothesis testing
  • Experience with visualization techniques and tools: Tableau and Python libraries
  • Experience with model fit testing, tuning, and validation techniques: grid search, metrics as precision, and root mean square error

What the JD emphasized

  • Experience deploying machine learning models and algorithms
  • Experience developing machine learning models and algorithms

Other signals

  • deploying machine learning models and algorithms
  • production code in Python
  • big data analytics using Pyspark
  • statistical methods
  • Data Science modeling techniques