Software Engineer, Acceleration Kernel Development

Tenstorrent Tenstorrent · Semiconductors · Toronto, ON · AI Software

Software Engineer focused on optimizing low-level compute kernels for AI hardware, directly impacting ML workload performance and efficiency.

What you'd actually do

  1. writing low-level code that directly powers high-efficiency machine learning workloads, optimizing every cycle, every memory move, every instruction.
  2. building and optimizing compute kernels for parallel ML and high-performance workloads.
  3. analyze and tune instruction-level performance across latency, memory, and bandwidth.
  4. debugging, profiling, and maintaining a fast, reliable low-level software stack.

Skills

Required

  • C/C++
  • high performance code development
  • parallel algorithms
  • compute optimization
  • ML workload optimization
  • kernel development
  • performance analysis and tuning
  • debugging
  • profiling

Nice to have

  • integrating optimizations into production
  • working with ML engineers
  • tuning performance on cutting-edge architectures

What the JD emphasized

  • high performance code
  • parallel algorithms
  • optimizing compute
  • ML workloads
  • performance
  • precision
  • compute kernels
  • ML
  • high-performance workloads
  • instruction-level performance
  • latency
  • memory
  • bandwidth
  • low-level software stack
  • AI hardware
  • kernels
  • ML frameworks
  • training pipelines
  • performance
  • cutting-edge architectures
  • hardware engineers
  • code lean
  • reliable
  • scalable

Other signals

  • Optimizing ML workloads
  • Low-level code for AI hardware
  • Performance tuning