Machine Learning Engineer II

Abnormal AI Abnormal AI · Vertical AI · United States · Remote · Message Security Detection

Machine Learning Engineer II to join the Message Detection - Attack Detection team, focusing on building a high-recall detection engine for cybersecurity threats. The role involves designing and implementing systems that combine rules, models, and feature engineering to improve detection efficacy, working with automated retraining pipelines, and analyzing data to identify and address capability gaps. The goal is to protect customers from evolving cyber adversaries by operating detection systems at high recall and low latency.

What you'd actually do

  1. Design and implement systems that combine rules, models, feature engineering, and business and product inputs into an email detection product, with senior engineer guidance.
  2. Identify and recommend new features groups or ML model approaches that can significantly improve detection efficacy for a product. Work with infrastructure & systems engineers to productionize signals to feed into the detection system.
  3. Train models on well-defined datasets to improve model efficacy on specialized attacks
  4. Actively monitor and improve FN rates and efficacy rates for our message detection product attack categories, through feature engineering, rules and ML modeling.
  5. Analyze FN and FP datasets to categorize capability gaps and recommend short term feature and rule ideas to improve our detection efficacy.

Skills

Required

  • 3+ years experience designing, building and deploying machine learning applications in one of the domains of text understanding, entity recognition, NLP experience, computer vision, recommendation systems, or search.
  • 1+ years of experience with writing stable and production level pipelines for model training and evaluation leading to reproducible models and metrics.
  • Experience with data analytics and wielding SQL+pandas+spark framework to both build data and metric generation pipelines, and answer critical questions about system efficacy or counterfactual treatments.
  • Ability to understand business requirements thoroughly and bias toward designing a simplest yet generalizable ML model / system that can accomplish the goal.
  • Uses a systematic approach to debug both data and system issues within ML / heuristics models.
  • Fluent with Python and machine learning toolkits like numpy, sklearn, pytorch and tensorflow.
  • Effective software engineering skills who can find answers quickly from code base and writes structured, readable, well tested and efficient code.
  • BS degree in Computer Science, Applied Sciences, Information Systems or other related engineering field

Nice to have

  • MS degree in Computer Science, Electrical Engineering or other related engineering field
  • Experience with big data, statistics and Machine Learning
  • Experience with algorithms and optimization

What the JD emphasized

  • extremely high recall Detection Engine
  • milliseconds latency
  • automated model retraining pipelines
  • production level pipelines for model training and evaluation
  • critical questions about system efficacy
  • simplest yet generalizable ML model / system
  • structured, readable, well tested and efficient code

Other signals

  • The Attack Detection team plays the central role of building an extremely high recall Detection Engine that can operate on hundreds of millions of messages at milliseconds latency.
  • The team builds discriminative signals at various levels including message level (eg. presence of particular phrases), sender-level (eg.frequency of sender) and recipient level (eg.likelihood of receiving a safe message).
  • Additionally, to continuously adapt to new unseen attacks, the team builds out different stages in our automated model retraining pipelines including data analytics and generation stages, modeling stages, production evaluation stages as well as automated deployment stages.