Applied Ai/ml Scientist

Cerebras Cerebras · Semiconductors · United Arab Emirates · Software

Applied AI Scientist role focused on developing and customizing large language and deep learning models for customer problems using Cerebras' AI chip. Responsibilities include customer use case discovery, architecting and executing end-to-end training recipes, fine-tuning models, and building components for agentic systems. Requires expertise in large model training, adaptation strategies (pre-training, SFT, RLHF/DPO), and distributed training.

What you'd actually do

  1. Architect and execute end-to-end training recipes for custom models, tailoring model architecture and training recipes to meet customer-specific performance and accuracy requirements.
  2. Design and implement sophisticated adaptation strategies, including continuous pre-training on private datasets, supervised fine-tuning (SFT), and post-training alignment via RLHF or DPO.
  3. Build and optimize the core components of agentic systems, focusing on tool-use capabilities, long-context reasoning, and multi-step planning.
  4. Collaborate with customer stakeholders to identify the best approaches to their business problem with AI.
  5. Define project milestones, success metrics, and rigorous evaluation benchmarks to ensure the solution delivers measurable value to the customer’s business.

Skills

Required

  • Python
  • PyTorch
  • distributed training frameworks
  • large-scale distributed data processing pipelines
  • Master’s or PhD in Computer Science, Machine Learning, or related fields
  • modern model architectures
  • scaling laws
  • training dynamics

Nice to have

  • MoEs
  • multimodal
  • sequence models
  • RLHF
  • DPO

What the JD emphasized

  • implementing, training, and scaling models
  • large model training
  • fine-tuning
  • training dynamics
  • model convergence
  • large-scale model training
  • distributed training

Other signals

  • customer use cases
  • custom SOTA models
  • large model training
  • agentic systems