Staff Software Engineer, Systems Engineering Focus

Crusoe · Data AI · San Francisco, CA - US · Cloud Engineering

Staff Software Engineer role focused on building and scaling customer-facing managed services for Crusoe Cloud, an AI infrastructure company. The role involves designing, developing, and operating edge agents and platform services, with a strong emphasis on systems programming, Linux kernel metrics (eBPF), Kubernetes, and reliability. The engineer will provide technical leadership and collaborate across infrastructure teams to ensure the smooth delivery and operation of these critical customer-facing components.

What you'd actually do

  1. Build and scale core platform services end-to-end — from greenfield 0-to-1 projects to scaling systems handling growing production traffic.
  2. Serve as the team's subject matter expert on edge software. Review existing agent architectures, provide technical guidance on inflight designs, and shape how we build and operate software at the system level.
  3. Build and maintain lightweight, high-reliability agents deployed on customer VMs. Minimize CPU/memory footprint without sacrificing observability coverage.
  4. Instrument low-level system metrics using eBPF and procfs to power Crusoe's monitoring and telemetry pipeline.
  5. Own agent packaging and deployment via Helm charts, ensuring smooth delivery across customer environments.

Skills

Required

  • Systems Programming Expertise
  • Python
  • Go
  • Shell scripting
  • Linux Kernel Metrics
  • eBPF
  • procfs
  • Kubernetes
  • Helm charts
  • Operational Mindset
  • On-call experience
  • Reliability-First Engineering
  • Scalable Design Thinking
  • Staff-Level Impact
  • Communication

What the JD emphasized

  • customer-facing managed services
  • agent code is mission-critical — a failure here is a customer production incident
  • On-call experience on a customer-facing team is required