Advanced Data Scientist

Honeywell Honeywell · Industrial · Gdansk, Pomorskie, Poland

This role focuses on developing and integrating AI/ML models and generative AI systems, including LLMs and multi-modal embeddings, to enhance security insights and product capabilities. It involves data engineering, evaluating third-party solutions, and deploying analytical systems for forensic and real-time security outcomes.

What you'd actually do

  1. Participate in extending product capabilities through development and deployment of analytical systems leveraging data mining, generative AI, and ML techniques.
  2. Interact with 3rd parties to evaluate integration feasibility and licensing of technologies and analytic components
  3. Design, prototyping, and implementation of new data-centric solutions
  4. Effectively communicate and collaborate with local teams, international teams, and 3rd parties
  5. Contribute to build versus buy decisions

Skills

Required

  • Python with scikit-learn
  • exploratory data analysis
  • ETL operations sourcing data from SQL, REST APIs, and flat files
  • data visualization technologies, such as Power BI, Tableau, matplotlib, Excel, etc.
  • Windows and Linux environments

Nice to have

  • building generative AI applications leveraging embeddings, LLMs, VLMs, vector databases, data source APIs, and agentic patterns/frameworks
  • deployment topologies including on-premises, hybrid, and cloud for production generative AI applications
  • building predictive and decision-making AI applications
  • cloud services from AWS, Azure, or GCP
  • computer vision and image/video analysis, including object detection, recognition, tracking, and identification
  • data mining algorithms and statistical modeling techniques such as clustering, classification, regression, decision trees, neural networks, SVMs, anomaly detection, recommender systems, pattern discovery, and text mining
  • analytical thinking
  • reconciling viewpoints
  • evaluating technologies
  • effective verbal and written communication skills
  • continuous learning mindset

What the JD emphasized

  • BS or MS in an appropriate technology field (Computer Science, Statistics, Applied Math, etc.)
  • 5+ years of experience in modern advanced analytical tools and programming languages, including Python with scikit-learn
  • Proficiency with exploratory data analysis
  • Proficiency with ETL operations sourcing data from SQL, REST APIs, and flat files
  • Experience in building generative AI applications leveraging embeddings, LLMs, VLMs, vector databases, data source APIs, and agentic patterns/frameworks

Other signals

  • develop new innovative solutions
  • deploy AI-ML models
  • leverage physical monitoring system datasets
  • unlock new security insights
  • integrating systems together
  • mashing-up datasets
  • analyzing them by leveraging data mining, generative AI, and ML techniques
  • drive decision making
  • data engineering
  • leveraging closed and open-source LLMs
  • text and multi-modal embedding models
  • development of new generative AI systems and ML models
  • deliver analytical systems that improve forensic and real-time security outcomes
  • extend the capabilities of our core product ecosystems