Infrastructure Engineer/sre

Cresta Cresta · Vertical AI · APAC · Remote · Engineering

The Infrastructure Engineer/SRE role at Cresta focuses on designing, building, and advancing core infrastructure to support engineering teams. Responsibilities include developing dev tools, ensuring reliability of Kubernetes clusters, implementing metrics and logging, managing infrastructure-as-code deployments, automating operations, and building machine learning infrastructure for AI teams.

What you'd actually do

  1. Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
  2. Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
  3. Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications.
  4. Infrastructure-as-code deployment tooling and supporting services on multiple cloud providers.
  5. Automate operations and engineering. Focus on automation so we can spend energy where it matters.
  6. Building machine learning infrastructure that enables AI teams to train, test, and deploy on large-scale datasets.

Skills

Required

  • Golang or Python
  • Kubernetes
  • Helm or Kustomize
  • Terraform or CloudFormation
  • AWS
  • GitOps tooling (Flux or Argo)
  • CI/CD (GitHub Actions)

Nice to have

  • GPU-enabled clusters
  • Google Cloud
  • Azure

What the JD emphasized

  • 5+ years experience in DevOps, Site Reliability Engineering, Production Engineering, or equivalent field.

Other signals

  • Building machine learning infrastructure
  • Ensure reliability of multi-cloud Kubernetes clusters and pipelines
  • Infrastructure-as-code deployment tooling