Senior Cloud Engineer

AMD AMD · Semiconductors · Austin, TX · General Management/ Administration/ Support

Senior Cloud Engineer at AMD responsible for supporting global cloud infrastructure, with a focus on the AMD Engineering AI/GPU - Compute Environment. The role involves designing, deploying, and maintaining cloud-native resources across Azure, AWS, and GCP, automating security and compliance, and collaborating with global teams to ensure operational excellence and support IT projects. While not directly building AI models, the role is critical in providing the underlying infrastructure and support for AI/GPU compute environments.

What you'd actually do

  1. Design, develop, deploy, monitor, maintain, and evolve cloud-native resources, tools, services, reusable modules (infrastructure-as-code-practices) and frameworks to secure and automate provisioning of cloud infrastructure that empowers our users across Azure, AWS, GCP.
  2. Provide customers with standards and best practices on how to deploy and consume cloud-based services.
  3. Proactively seek opportunities to improve operational efficiency of teams and usage of cloud services.
  4. Contribute to a strong team-culture and an atmosphere of cross-functional teamwork.
  5. Work with internal customers in managing incident tickets to achieve operational excellence.

Skills

Required

  • Azure
  • AWS
  • GCP
  • infrastructure-as-code
  • security controls
  • governance processes
  • compliance validation
  • identity (IAM)
  • access management
  • configuration management
  • Recovery and Continuity process
  • hybrid deployments
  • virtual machines
  • PaaS solutions
  • IaaS
  • PaaS
  • SaaS deployment

Nice to have

  • Terraform
  • YAML
  • Jenkins
  • GitHub actions
  • Python
  • Golang
  • Shell
  • Java/J2EE
  • NodeJS
  • ReactJS
  • HTML5
  • PyTorch
  • CI/CD pipeline
  • AI framework
  • GPU clusters
  • Container technologies
  • GKE
  • EKS
  • ECS
  • Docker
  • Kubernetes
  • CHANGE Management
  • Release Process
  • Agile/Scrum methodologies
  • Ansible
  • Cloud Formation
  • Deployment Mgr.
  • Resource Mgr.
  • Kubernetes/containers
  • virtualization
  • functions
  • automation
  • encryption
  • authorization
  • protocols
  • system performance monitoring
  • capacity planning
  • Cloud networking
  • VPCs
  • Load balancers
  • WAFs
  • CDNs
  • Infrastructure platforms
  • Hybrid environment
  • Cloud native monitoring tools
  • Nagios
  • ELK stack
  • Kibana
  • Prometheus

What the JD emphasized

  • security controls
  • compliance validation
  • GPU clusters