AI Compute Engineer

NVIDIA NVIDIA · Semiconductors · Yokneam, Israel +2

NVIDIA is seeking an AI Compute Engineer to deploy, manage, and maintain AI infrastructure for customers, focusing on large-scale AI factory projects involving networking, system design, and automation. The role requires strong Linux and data center experience, with a focus on customer interaction and support.

What you'd actually do

  1. Primary responsibilities will include deploying, managing, and maintaining AI infrastructure for new and existing customers.
  2. Be the domain expert with customers during planning calls through implementation.
  3. Handover-related documentation and perform knowledge transfers required to support customers as they begin rolling out some of the most sophisticated systems in the world!
  4. Provide feedback to internal teams such as opening bugs, documenting workarounds, and suggesting improvements.

Skills

Required

  • Bachelor’s degree in computer science, Electrical Engineering, or a related field, or equivalent experience.
  • 2+ years providing in-depth support and deployment services in data centers, solving problems for hardware and software products.
  • Knowledge and experience with Linux Operating System, administration, process management, package management, task scheduling, kernel management, boot procedures/troubleshooting, performance reporting/optimization/logging.
  • Knowledge and experience with Data center network products and protocols.
  • Knowledge and experience with Data center monitoring solutions.
  • Good interpersonal skills with the ability to maintain and deliver resolutions for customer-blocking issues as they arise.
  • Excellent verbal and written English skills.
  • Strong organizational skills and ability to prioritize/multi-task easily with limited supervision.

Nice to have

  • Kubernetes Experience.
  • Experience with AI-focused hardware(GPUs) or software(Training/Inferencing).
  • Scripting/Automation tooling background
  • Industry-standard data center certifications.
  • Prometheus/Grafana Experience.

What the JD emphasized

  • AI infrastructure
  • AI factory projects
  • AI-focused hardware(GPUs) or software(Training/Inferencing)

Other signals

  • AI factory systems
  • deploying, managing, and maintaining AI infrastructure
  • customer-focused team