Research Scientist, AI & Systems Co-design (phd)

Meta Meta · Big Tech · Menlo Park, CA

Research Scientist focused on co-designing AI hardware and software systems for datacenter scale, optimizing models, runtime, and hardware for performance and efficiency in Generative AI and Recommendations. The role involves exploration, prototyping, and assisting in productionization, with a strong emphasis on research publications and influencing future AI hardware roadmaps.

What you'd actually do

  1. Explore, co-design and optimize parallelisms, compute efficiency, distributed training/inference paradigms and algorithms to improve the scalability, efficiency, and reliability of GenAI systems
  2. Innovate and co-design novel model deployment techniques for sustained scaling and hardware efficiency during GenAI serving
  3. Benchmark, analyze, model, and project the performance of AI workloads against a wide range of what-if scenarios and provide early input to the design of future hardware, models, and runtime, giving crucial feedback to the architecture, compiler, kernel, modeling, and runtime teams
  4. Explore, prototype and productionize highly optimized ML kernels to unlock full potential of current and future accelerators for Meta’s AI workloads
  5. Influence the hardware roadmap of Meta’s custom AI accelerators

Skills

Required

  • PhD in Computer Science, Electrical Engineering, Applied Mathematics, or a related technical field, OR a Master's degree with 3+ years of relevant industry experience
  • Proven research experience in hardware-aware model enablement, performance modeling of AI systems or prevailing accelerators/silicon architectures
  • Hands-on proficiency with end-to-end AI hardware architecture or on-device mapping algorithm development
  • Theoretical background and practical experience with AI models (e.g., CNNs, Transformers, LLMs, Diffusion models)
  • Experience in system-level performance analysis, profiling, and benchmarking of AI workloads
  • In-depth experience of Python
  • Experience with at least one major AI framework
  • Track record of publishing research papers at peer-reviewed conferences or journals
  • Experience communicating technical results to cross-functional stakeholders
  • Experience with deploying AI agents/prevalining techniques for increased efficiency
  • Experience or knowledge of training/inference of large-scale deep learning models
  • Familiarity with low-level programming for specialized hardware (e.g., CUDA, HIP, Triton) or hardware description languages (HDL)
  • Experience or knowledge of distributed ML systems and algorithm development
  • Experience or knowledge of either Generative AI models such as LLMs/LDMs or Ranking & Recommendation models such as DLRM or equivalent

Nice to have

  • Experience with AI & Systems Co-design

What the JD emphasized

  • PhD
  • proven research experience
  • track record of publishing research papers

Other signals

  • co-design of AI hardware and software
  • optimizing AI models for accelerators
  • research and productionization of AI systems