Tpu Kernel Engineer

Anthropic Anthropic · AI Frontier · AI Research & Engineering

This role focuses on optimizing ML systems, particularly for TPUs, by designing and implementing kernels to improve performance for research, training, and inference. It involves low-level optimization and providing feedback on model performance impacts.

What you'd actually do

  1. Identify and address performance issues across many different ML systems, including research, training, and inference.
  2. Design and optimize kernels for the TPU.
  3. Provide feedback to researchers about how model changes impact performance.
  4. Solve large-scale systems problems and low-level optimization.

Skills

Required

  • TPU optimization
  • kernel development
  • ML systems performance
  • low-level optimization
  • computer architecture

Nice to have

  • GPU optimization
  • ML framework internals
  • language modeling
  • transformers

What the JD emphasized

  • significant experience optimizing ML systems for TPUs, GPUs, or other accelerators
  • solving large-scale systems problems
  • low-level optimization
  • Designing and implementing kernels for TPUs or other ML accelerators
  • Understanding accelerators at a deep level, e.g. a background in computer architecture

Other signals

  • TPU optimization
  • kernel development
  • ML systems performance