Data Scientist - Dlp

Cyera Cyera · Vertical AI · Tel Aviv, Israel · Research

Research Data Scientist focused on developing and productionizing ML/AI models for sensitive data classification and security in an enterprise AI context. The role involves end-to-end ownership of the ML lifecycle, including applying LLMs for tasks like prompt engineering, fine-tuning, RAG, and rigorous evaluation, as well as traditional ML techniques for clustering, text extraction, and tabular data classification.

What you'd actually do

  1. Responsibility for an end-to-end research process. That includes identifying the problem, feature engineering, model development to deployment, outcome analysis, and refinement.
  2. You will be a hands-on domain leader, laying the foundations of our data science workflows and algorithms. This is an excellent opportunity to work with endless amounts of user data and creatively generate insights that will increase the ability to classify tons of data.
  3. Develop, evaluate, and maintain machine learning and NLP solutions to enhance Cyera’s core capabilities in sensitive data classification.
  4. Innovation and creative thinking are the keys! Implementing ML models to the entire research process - clustering, text extraction, document analysis, and tabular data classification.

Skills

Required

  • BSc in computer science, math, physics, or a related field
  • 5+ years of experience as a data scientist
  • Experience with data pipelines / big-data analytics
  • Solid grounding in core machine learning concepts and techniques
  • Demonstrated expertise in applying LLMs - prompt engineering and prompt tuning (few-shot, chain-of-thought, tool/function calling, routing), task adaptation (instruction/SFT, PEFT/LoRA, DPO/RLHF), retrieval-augmented generation, rigorous evaluation and production deployment with appropriate safety, latency, and cost controls.
  • 3+ years of hands-on experience in programming in python or a similar scripting language.
  • Self-learner, initiator, able to quickly learn new technologies

Nice to have

  • MSc in computer science, math, physics, or related field
  • Experience in NLP

What the JD emphasized

  • Experience with data pipelines / big-data analytics - Must
  • Demonstrated expertise in applying LLMs
  • Experience in NLP - a significant advantage

Other signals

  • Developing and productionizing ML/AI models for data security.
  • End-to-end ownership of ML lifecycle from problem framing to deployment.
  • Applying LLMs for sensitive data classification and insights.