Data Scientist

Caterpillar · Industrial · Bangalore, Karnataka

Data Scientist role at Caterpillar focused on performing analytical tasks and initiatives on large datasets to support data-driven business decisions and enable the Packaging Analytics process. Responsibilities include data gathering, processing, exploring ML techniques like NLP and text analysis, defining analysis scope, reporting insights, and conducting research on data model optimization and algorithms.

What you'd actually do

  1. Directing data gathering, data mining, and data processing processes in huge volume; creating appropriate data models.
  2. Exploring, promoting, and implementing semantic data capabilities through Natural Language Processing, text analysis and machine learning techniques.
  3. Leading to define requirements and scope of data analyses, presenting and reporting possible business insights to management using data visualization technologies.
  4. Conducting research on data model optimization and algorithms to improve effectiveness and accuracy on data analyses.

Skills

Required

  • Advanced Analytics
  • Advanced Machine Learning
  • AI
  • O365 suite (Power BI, Automate, App)
  • Predictive Modeling
  • quantitative analysis
  • statistics
  • programming
  • statistical modeling
  • core ML concepts
  • SAS
  • Python
  • SQL
  • Snowflake
  • Power-BI
  • Tableau
  • problem solving
  • Lean Mindset
  • Supply chain
  • Logistics
  • Packaging Analytics
  • IT Systems experience
  • innovation
  • leadership
  • initiative
  • judgment
  • interpersonal skills
  • communication of quantitative information

Nice to have

  • Master’s in computer science/mathematics/supply chain/Statistics

What the JD emphasized

  • Advanced Analytics, Advanced Machine Learning (& its models), AI, O365 suite (including but not limited to Power BI, Automate, App etc.) & Predictive Modeling
  • Strong quantitative analysis, statistics, programming, and statistical modeling skills.
  • Understanding of core ML concepts and its application in solving real world problems
  • High level of comfort/expertise in SAS & Python
  • Expertise in SQL, Snowflake & programming languages
  • Visualization using Power-BI/Tableau.
  • Capable problem solver who uses logic to create effective solutions to complex customer problems
  • Possess a Lean Mindset for executing continuous improvements/automations in the process which is being delivered
  • Must demonstrate superior innovation, leadership, initiative, judgment, interpersonal skills and the ability to communicate quantitative information effectively.

Other signals

  • data gathering, data mining, and data processing processes in huge volume
  • exploring, promoting, and implementing semantic data capabilities through Natural Language Processing, text analysis and machine learning techniques
  • research on data model optimization and algorithms to improve effectiveness and accuracy on data analyses