Hardware Engineer - Liquid Cooling

Weights & Biases Weights & Biases · Data AI · Bellevue, WA +2 · Technology

This role focuses on the design, development, and optimization of liquid cooling hardware infrastructure for AI data centers. Responsibilities include automating the hardware lifecycle, defining requirements, optimizing performance, and implementing monitoring systems for cooling components. The role requires experience with server hardware, data center thermal design, and automation tools like Ansible/Python.

What you'd actually do

  1. Develop and maintain hardware/firmware management services.
  2. Automate all aspects of the liquid cooling hardware lifecycle.
  3. Collaborate with cross-functional teams to define liquid cooling hardware requirements, specifications, and system architecture.
  4. Create and maintain accurate documentation of liquid cooling hardware designs, specifications, test procedures, and results.
  5. Analyze and optimize the performance of liquid cooling hardware systems, identify bottlenecks, and propose improvements for enhanced efficiency.

Skills

Required

  • in-rack CDUs, liquid-to-air heat exchangers, or end-of-row cooling units
  • server hardware, components, and management technologies
  • data center thermal design principles, including air and liquid cooling fundamentals
  • coolant types, thermal transfer mechanisms, and leak detection/mitigation techniques
  • ansible/python and experience with programmatically interacting with hardware using IPMI, SSH, or Redfish
  • collaborate with hardware and facility engineering teams
  • automation and infrastructure scalability
  • documentation skills and attention to detail
  • analytical and problem-solving abilities

Nice to have

  • Redfish

What the JD emphasized

  • liquid cooling hardware