Senior Staff Data Scientist Manager, AI Data

Google Google · Big Tech · Mountain View, CA +1

This role focuses on improving the quality of data used for training Machine Learning models, particularly Large Language Models (LLMs). The responsibilities include analyzing large datasets, defining data quality metrics, researching methods to enhance data quality, and influencing product direction through data insights. The role requires significant experience in data analysis and a background in quantitative fields.

What you'd actually do

  1. Work with large, data sets. Conduct analysis that includes data gathering and requirements specification, processing, cleaning and curation, analysis, visualization, ongoing deliverables, and presentations.
  2. Represent analysis to stakeholders and organization executives in order to share insights, influence product direction and answer difficult questions regarding data quality measurement and impact on model performance.
  3. Define key metrics that are statistically sound and meaningful to measure data quality for data in various shapes and forms, as well as to measure progress of customer engagement.
  4. Research and develop analysis and optimization methods to improve the quality of Google's ML portfolio and applications, including LLM model and training data planning.
  5. Conduct independent research and advance the state of understanding in how data impacts ultimate quality of large language models and creating spend optimization priorities with data acquisition.

Skills

Required

  • Python
  • R
  • SQL
  • data analysis
  • statistical analysis
  • querying databases

Nice to have

  • people manager
  • technical leadership

What the JD emphasized

  • high quality data is key to building better Machine Learning (ML) models
  • data quality
  • model performance
  • data quality measurement
  • data impacts ultimate quality of large language models

Other signals

  • data quality
  • ML models
  • LLM
  • data analysis
  • metrics