AI Safety Scientist, Deep Learning

NVIDIA NVIDIA · Semiconductors · Ho Chi Minh City, Vietnam +1

Research Scientist focused on AI safety for multilingual, multimodal LLMs, including content safety, ML fairness, bias detection, and hallucination mitigation. The role involves developing datasets, moderator models, and training techniques (SFT, RL), and contributing to safety tools.

What you'd actually do

  1. Develop datasets and moderator models for evaluating LLM models and end-to-end systems for Content Safety, ML Fairness.
  2. Develop datasets for training LLM models with SFT and RL techniques, for Content Safety, ML Fairness, Security and more.
  3. Research and implement cutting-edge techniques for bias detection and mitigation in LLMs and systems.
  4. Define and track key metrics for responsible LLM behavior and usage.
  5. Contribute to our repositories and develop safety tools to help ML teams be more effective.

Skills

Required

  • Master’s or PhD in Computer Science, Electrical Engineering or related field - or have equivalent experience
  • 5+ years of work experience in developing and deploying machine learning models in production
  • Strong understanding of machine learning principles and algorithms
  • Hands-on programming experience in python
  • In-depth knowledge of machine learning frameworks, like Keras or PyTorch
  • 2+ years of work experience with one or more of the following broader areas for 2+ years: Content Safety, ML Fairness, AI Model Security, or related areas
  • Experience with one or more of the following areas within Content Safety: Hate/Harassment, Sexualized, Harmful/Violent, or other specific areas from your application
  • Experience working with large multi-lingual datasets and multi-lingual models
  • Good at problem solving and analytical ability
  • Excellent collaboration and communication skills

Nice to have

  • Experience with alignment/fine-tuning of LLMs - including regular LLMs as well as VLMs (Vision Language Model) or any-to-text
  • Prior experience with multimodal and/or multilingual Content Safety, legal and regulatory compliance
  • Experience with Hallucinations/Generative Misinformation
  • Experience with GenAI Security including Prompt Stability, Model Extraction, Confidentiality/Data Extraction, Integrity, Availability and Adversarial Robustness
  • Passion for AI and a demonstrated commitment to advancing the field through innovative research, prior scientific research and publication experience

What the JD emphasized

  • 5+ years of work experience in developing and deploying machine learning models in production
  • 2+ years of work experience with one or more of the following broader areas for 2+ years: Content Safety, ML Fairness, AI Model Security, or related areas
  • Experience with one or more of the following areas within Content Safety: Hate/Harassment, Sexualized, Harmful/Violent, or other specific areas from your application.

Other signals

  • LLM safety
  • ML fairness
  • bias detection
  • hallucinations
  • multilingual LLMs
  • multimodal LLMs