Site Reliability Engineer - Infrastructure

Verkada · Enterprise · Bayoffice · Infrastructure/Platform

Verkada is seeking a Site Reliability Engineer to manage and scale their AI-powered platform infrastructure. The role involves optimizing cluster costs, enforcing security, improving monitoring, and adopting a service mesh. Responsibilities include keeping infrastructure operational, enhancing automation, defining roadmaps, and providing technical support.

What you'd actually do

  1. Keep our infrastructure up!
  2. Improve infrastructure automation
  3. Define infrastructure roadmap
  4. Provide technical support for engineers on other teams

Skills

Required

  • BS, MS, or PhD in Computer Science, or similar technical field of study
  • 1-2+ years of experience in a similar position
  • Experience in at least one scripting language (preferably Python)
  • Experience with one of the major cloud platforms (preferably AWS)
  • Experience with Kubernetes
  • Experience with Terraform

Nice to have

  • Experience with ArgoCD
  • Experience writing Kubernetes controllers
  • Experience with service mesh

What the JD emphasized

  • integrated, AI-powered platform
  • cloud physical security
  • AI-powered platform