Data Scientist II

Honeywell Honeywell · Industrial · Pittsford, NY +1

This role focuses on developing and deploying AI/ML models and generative AI systems, including LLMs and multi-modal embeddings, to enhance security insights from physical access control system datasets. It involves data engineering, integrating third-party solutions, and building analytical systems for forensic and real-time security outcomes.

What you'd actually do

  1. Participate in extending product capabilities through development and deployment of analytical systems leveraging data mining, generative AI, and ML techniques.
  2. Interact with 3rd parties to evaluate integration feasibility and licensing of technologies and analytic components
  3. Design, prototyping, and implementation of new data-centric solutions
  4. Effectively communicate and collaborate with local teams, international teams, and 3rd parties
  5. Contribute to build versus buy decisions

Skills

Required

  • BS or MS in Computer Science, Statistics, Applied Math, or related field
  • 2+ years of experience in modern advanced analytical tools and programming languages
  • Python with scikit-learn
  • Exploratory data analysis
  • ETL operations
  • SQL
  • REST APIs
  • Data visualization technologies (e.g., Power BI, Tableau, matplotlib, Excel)
  • Traditional programming languages (e.g., Python, JavaScript, TypeScript, C++, C#)
  • Windows and Linux environments

Nice to have

  • Building generative AI applications leveraging embeddings, LLMs, VLMs, vector databases, data source APIs, and agentic patterns/frameworks
  • Deployment topologies (on-premises, hybrid, cloud) for generative AI applications
  • Building predictive and decision-making AI applications
  • Cloud services (AWS, Azure, GCP)
  • Computer vision and image/video analysis (object detection, recognition, tracking, identification)
  • Data mining algorithms
  • Statistical modeling techniques (clustering, classification, regression, decision trees, neural networks, SVMs, anomaly detection, recommender systems, pattern discovery, text mining)

What the JD emphasized

  • BS or MS in an appropriate technology field (Computer Science, Statistics, Applied Math, etc.)
  • 2+ years of experience in modern advanced analytical tools and programming languages, including Python with scikit-learn
  • Some experience with exploratory data analysis
  • Some experience with ETL operations sourcing data from SQL, REST APIs, and flat files
  • Basic experience with data visualization technologies, such as Power BI, Tableau, matplotlib, Excel, etc.
  • 2+ years of experience in traditional programming languages such as Python, JavaScript, TypeScript, C++, or C#
  • Comfortable in Windows and Linux environments

Other signals

  • develop new innovative solutions
  • deploy state-of-the-art AI-ML models
  • leverage physical access control system datasets
  • integrating systems together
  • mashing-up datasets
  • leveraging state-of-the-art data mining, generative AI, and ML techniques
  • data engineering
  • leveraging closed and open-source LLMs
  • text and multi-modal embedding models
  • development of new generative AI systems and ML models
  • analytical systems that improve forensic and real-time security outcomes
  • extending product capabilities through development and deployment of analytical systems
  • build versus buy decisions