Systems Engineer, Region Services

Amazon Amazon · Big Tech · NSW, Australia +1 · Systems, Quality, & Security Engineering

This role focuses on systems engineering within AWS Region Services, emphasizing the development and maintenance of secure, scalable cloud environments. Responsibilities include defining hardware requirements, developing operational tools, managing system health, and participating in incident response. While AI/ML are mentioned as technologies used by the team, the core craft of this specific role is not AI/ML development but rather systems engineering and operational excellence in a secure cloud context.

What you'd actually do

  1. Define and/or refine hardware requirements, participate in the development and delivery of operability-related features such as system health monitoring, diagnostics, repair, and other self-healing automation
  2. Develop or further existing application and system management tools and processes that reduce manual efforts and increase overall efficiency
  3. Adapt and improve operations management systems and processes to accommodate rapid and increasing growth in systems and traffic
  4. Participate in the design and execution of production acceptance tests and new hardware evaluations
  5. Monitor the health of the fleet, automating system health, maintenance tasks, and reporting systems as needed

Skills

Required

  • 3+ years of experience with Linux, using the command line and basic administration, and computer networking fundamentals
  • 3+ Years of Operations experience working with CI/CD Pipelines and deployment systems like; Terraform, Github Actions, Jenkins, or others
  • Able to troubleshoot at all levels, from network to operating systems to software applications
  • 3+ Years working in Linux or other UNIX based Operating Systems
  • Experience supporting cloud systems or other services. Proficient troubleshooting and anticipating problems that affect the performance, reliability, or availability of software systems

Nice to have

  • Experience operating 24x7 high-availability, distributed software applications and performance tuning software applications and optimizing fleet utilization
  • Understanding of network fundamentals (DNS, DHCP, TCP/IP, routing, load balancing, loa

What the JD emphasized

  • Australian citizens
  • Australian Government Security Clearance
  • Organisational Suitability Assessment