Senior Devops Engineer

Upstart · Fintech · Remote · Engineering

Upstart is an AI lending marketplace that uses AI to reshape access to credit. The Cloud Platform team builds and operates the shared cloud infrastructure for product and machine learning workloads, owning core components across Kubernetes, AWS, service mesh, identity, and developer tooling. The Senior DevOps Engineer will evolve this platform to support increasing scale and complexity, partnering with SRE, Delivery, InfoSec, and product/ML teams to improve reliability, developer experience, and cost efficiency.

What you'd actually do

  1. Design and operate a fleet of Kubernetes (EKS) clusters across production, staging, and ephemeral environments, ensuring reliability and high availability
  2. Evolve AWS infrastructure and network architecture (VPCs, subnets, IAM, account structure) to support scalable, multi-team workloads
  3. Build and maintain infrastructure-as-code and GitOps workflows using tools such as Terraform, CDK, and ArgoCD
  4. Improve platform reliability and performance by defining and driving SLOs, analyzing incidents, and implementing systemic fixes
  5. Participate in and help improve the on-call rotation, leading incident response and post-incident reviews to drive systemic platform improvements

Skills

Required

  • Bachelor’s degree in Computer Science, Engineering, Mathematics, or a related field (or equivalent practical experience) plus 4+ years of experience
  • Experience operating Kubernetes in production environments, including cluster networking, storage, and RBAC
  • Proficiency with AWS infrastructure, including VPC design, networking, and IAM
  • Proven expertise in implementing infrastructure-as-code using tools such as Terraform or AWS CDK
  • Experience implementing GitOps workflows using tools such as ArgoCD or similar
  • Ability to influence technical decisions across teams and drive adoption of platform standards

Nice to have

  • Knowledge of service mesh technologies such as Istio or Envoy
  • Experience designing or operating multi-cluster Kubernetes architectures
  • Experience with cloud networking at scale, including ingress/egress or edge platforms (e.g., Cloudflare)
  • Knowledge of cloud security, identity, and compliance frameworks (e.g., IAM, SOC 2, CIS benchmarks)

What the JD emphasized

  • operating Kubernetes in production environments
  • Proficiency with AWS infrastructure
  • implementing infrastructure-as-code
  • implementing GitOps workflows