Senior Cloud Support Engineer

Weights & Biases Weights & Biases · Data AI · Singapore · Technology - COR

Senior Cloud Support Engineer role focused on supporting AI workloads on a Kubernetes-powered HPC cloud infrastructure. Responsibilities include troubleshooting, mentoring, training, and improving support processes. Requires strong Kubernetes, Linux, and observability skills.

What you'd actually do

  1. Guide and mentor team members in developing their technical skills and troubleshooting capabilities across all disciplines supported by CoreWeave.
  2. Provide real-time feedback and coaching, reviewing tickets to identify opportunities for improvement and ensure quality assurance (QA).
  3. Develop and deliver training sessions to improve the team’s proficiency and efficiency in resolving customer issues.
  4. Use technical expertise to investigate, debug, and resolve customer-impacting issues with the curiosity required to uncover and understand root causes.
  5. Maintain high customer satisfaction through swift, accurate, and empathetic high-touch support communications, as well as established best practices.

Skills

Required

  • Kubernetes
  • Linux system administration
  • networking
  • load balancing
  • storage volumes
  • observability
  • node management
  • High-Performance Computing (HPC)
  • troubleshooting complex customer issues
  • mentoring team members
  • technical presentation skills

Nice to have

  • CKA Certified
  • Grafana

What the JD emphasized

  • Kubernetes
  • troubleshooting
  • AI workloads