Senior Software Enginer, Cape

Crusoe Crusoe · Data AI · San Francisco, CA - US · Cloud Engineering

This role is for a Senior Software Engineer focused on building and operating cloud infrastructure management systems and platforms for an AI infrastructure company. The engineer will be responsible for end-to-end use cases and workflows, contributing to systems that manage, monitor, deploy, and operate infrastructure, directly impacting business revenue. The role emphasizes reliability, scalability, operational efficiency, and ease of use in a vertically integrated AI-first environment.

What you'd actually do

  1. Collaborate extensively across teams to design and implement physical infrastructure management software systems, availability platforms, and frameworks that meet the end-to-end needs of customers hosted on our AI infrastructure.
  2. Contribute to the reliability, scalability, and security of our systems and platforms.
  3. Develop workflows that drive efficiency and meet key business objectives and metrics.
  4. Design and implement high-performing, highly available cloud architectures optimized for both performance and cost-effectiveness.
  5. Streamline cloud deployment, configuration management, and operations by developing and maintaining effective platforms, interfaces, and automation tooling.

Skills

Required

  • 5+ years of experience building and operating distributed systems at scale
  • Proven experience with building reliable, scalable, efficient, and secure cloud platforms and systems and effectively running them in production environments
  • Fluency in one or more programming languages such as Go, Rust, Java, or C++
  • A collaborative mindset when working with development and operations teams to build, maintain, and drive adoption of a robust platform
  • A solid understanding of cloud security best practices and the ability to implement secure configurations
  • Excellent troubleshooting and problem-solving skills to tackle complex infrastructure challenges
  • Strong written and verbal communication skills

Nice to have

  • Hands-on experience deploying, managing, and troubleshooting Kubernetes clusters
  • Experience working in a fast-paced startup environment

What the JD emphasized

  • scale Crusoe Cloud by 10X and beyond