Data Scientist II

Caterpillar Caterpillar · Industrial · Peoria, IL

This role involves developing machine learning models and AI-assisted analysis tools to provide faster insights and improve decision-making within Caterpillar. The Data Scientist II will transform business questions into technical requirements, investigate data gaps, and create data products and automations. Responsibilities include developing and deploying production-level machine learning models, both supervised and unsupervised, and utilizing tools like SQL, Python, and Power BI for data analysis and visualization.

What you'd actually do

  1. Develop machine learning models and AI assisted analysis / tools to deliver faster insights and better decision making
  2. Transform business questions into technical requirements that can deliver flexible, sustainable data products to serve the business.
  3. Investigate and assess gaps in business process and data assets to recommend changes to and create new data products and automations.
  4. Develop engaging, actionable, and visually appealing reports and dashboards in Microsoft PowerBI (or other visualization techniques, including Python)

Skills

Required

  • SQL
  • data visualization
  • Python
  • Snowflake or cloud data services
  • data analytics techniques
  • automation
  • data science
  • AI
  • deploy and maintain production-level code
  • Accuracy and Attention to Detail
  • Effective Communications
  • Business Assessment
  • Research Analysis

Nice to have

  • Professional experience in data analytics or data science role
  • Professional experience developing and deploying machine learning models (supervised and unsupervised)
  • Intermediate to expert knowledge in Python or another object-oriented programming language
  • Technical background using datasets in SQL
  • Experience creating visuals and modeling data in Power BI
  • Ability to collect required business data through various data sources and apply data analysis techniques to establish, modify, and maintain business data structures
  • Strong communication skills with the ability to explain work to business partners with limited data knowledge

What the JD emphasized

  • deploy and maintain production-level code
  • developing and deploying machine learning models (supervised and unsupervised)

Other signals

  • Develop machine learning models
  • AI assisted analysis / tools
  • deploy and maintain production-level code
  • developing and deploying machine learning models (supervised and unsupervised)