Staff Software Engineer, Cloud Infrastructure

Tenstorrent · Semiconductors · United States · Cloud Platform

Staff Software Engineer focused on cloud infrastructure, SRE, and DevOps to support AI technology development. Responsibilities include infrastructure automation, integration, operations, backend development, and IaC, leveraging tools like Python, Ansible, Prometheus, and Kubernetes. The role will utilize AI tools and collaborate with AI/ML experts.

What you'd actually do

  1. Hands-on software engineering to push infrastructure and operational excellence further.
  2. Effective collaboration with end-users, peers, domain experts, and stakeholders.
  3. Leadership to grow teams’ capabilities and eagerness to learn more.

Skills

Required

  • Python
  • Infrastructure-as-Code (Ansible)
  • shell scripting
  • Linux SysOps
  • CI/CD
  • DevOps
  • software integrations
  • operational infrastructure
  • observability
  • telemetry
  • monitoring
  • alerting
  • Prometheus
  • Loki
  • Alloy
  • Grafana
  • Sentry
  • SNMP
  • Redfish
  • IPMI
  • Bare Metal provisioning
  • Virtual Machine provisioning
  • Kubernetes provisioning
  • Bare Metal operations
  • Virtual Machine operations
  • Kubernetes operations

Nice to have

  • Neocloud / CSP background

What the JD emphasized

  • Fluent in Python, Infrastructure-as-Code (Ansible), shell scripting, Linux SysOps, and CI/CD.
  • Experienced in observability, including hardware, system, and application level telemetry, monitoring, and alerting (Prometheus, Loki, Alloy, Grafana, Sentry, SNMP, Redfish, IPMI).
  • Familiarity with Bare Metal, Virtual Machine and Kubernetes provisioning and operations.
  • This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology.