Senior Cloud Support ​​engineer

Crusoe · Data AI · Tel Aviv, IL · Cloud Go-To-Market (GTM)

This role is for a Senior Cloud Support Engineer who will be a key technical partner for customers using Crusoe's GPU-powered infrastructure. Responsibilities include delivering technical support, diagnosing and resolving issues related to VMs and hardware, managing alerts, and creating documentation. The role requires strong CLI skills, Linux environment navigation, experience with container orchestration, workload management, and observability tools, as well as familiarity with major cloud providers.

What you'd actually do

  1. Deliver hands-on, high-quality technical support through Zendesk, meeting SLAs and keeping CSAT at 95%+.
  2. Be part of a 24/7 support rotation ensuring rapid response to critical issues.
  3. Diagnose and resolve issues related to VMs, hardware failures, and scaling tests using CLI and internal tools.
  4. Manage alert triage, support maintenance windows, and run node delivery testing.
  5. Work closely with global SRE, Networking, and Storage teams from triage through to RCA (root cause analysis).

Skills

Required

  • Bachelor’s degree in IT, Computer Science, Engineering, or equivalent experience (4+ years in a similar technical role).
  • Strong CLI skills and comfort navigating Linux environments.
  • Experience using Git for collaboration and version management.
  • At least 5 years in technical support (cloud, storage, or networking).
  • Hands-on experience with container orchestration (e.g., Kubernetes), workload management (e.g., Slurm, Terraform), and observability tools (e.g., Grafana).
  • Familiarity with AWS, Azure, or GCP.
  • Clear, concise communicator with the ability to prioritize competing escalations.
  • Understanding of high-performance computing technologies like Infiniband, Slurm, RDMA, RoCE, and SDN.

Nice to have

  • Experience with storage technologies such as NVMe, SSDs, and distributed storage systems.
  • Experience with block storage, object storage, and/or file storage. Familiarity with storage protocols like NFS, SMB, iSCSI, and NVMe-oF.
  • Certifications: CKA, CKAD, CKS, KCNA, AWS (Machine Learning, Data Analytics, Solutions Architect, Developer), NVIDIA AI, or Linux Foundation certs.
  • Experience with automation tools or scripting languages.
  • Enjoy coaching teammates and sharing knowledge.
  • Passion for making technology more efficient and sustainable.

What the JD emphasized

  • meeting SLAs
  • 24/7 support rotation
  • rapid response to critical issues
  • CLI
  • Linux environments
  • container orchestration
  • workload management
  • observability tools