Systems Engineer, Managed Operations, Managed Operations

Amazon Amazon · Big Tech · DE, Belgium +1 · Systems, Quality, & Security Engineering

This role is for a Systems Engineer within AWS Managed Operations, focusing on the European Sovereign Cloud (ESC). The primary responsibilities include building, operating, and evolving teams and services to ensure high availability of AWS services for EU customers. The role involves overseeing operations, collaborating with global teams, influencing service evolution, and ensuring continuous improvements in availability, reliability, latency, performance, and efficiency. It also includes on-call rotations and contributing to operational excellence within the Utility Computing organization.

What you'd actually do

  1. play a pivotal role in building, operating and evolving operations and development teams dedicated to delivering high-availability AWS services, including EC2, S3, Dynamo, Lambda, and Bedrock, exclusively for EU customers.
  2. overseeing the ongoing operations and expansion of the ESC, working closely with global AWS teams, and influencing the evolution of AWS services and technology.
  3. collaborating with technology leaders, contributing to the enhancement of day-to-day operations, and ensuring continuous improvements in availability, reliability, latency, performance, and efficiency of the ESC.
  4. occasionally participate in “on-call” rotations to resolve incidents occurring out-of-hours.
  5. root-cause issues and ensure your systems remain resilient and fault-tolerant, underscoring your commitment to maintaining operational excellence.

Skills

Required

  • Experience in Linux OS and network troubleshooting, or experience in networking administration and troubleshooting
  • Experience in Python, Perl, or another scripting language
  • Experience in Systems engineering, site reliability engineering, building and operating systems at scale
  • This role requires you to be a national of an EU member state
  • Able to lead the creation, revision, and/or improvement of standard operational procedures (SOPs) and driving operational best practices.

Nice to have

  • Experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar).
  • Experience actively mentoring junior engineers and working cross-organizationally and leading strategic team efforts requiring work from multiple team members
  • Experience operating 24x7 high-availability, distributed software applications and performance tuning software applications and optimizing fleet utilization
  • Experience with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar)
  • Experience with CI/CD pipelines, DevOps practices, and Generative AI technologies, including automated deployment, configuration management, continuous integration workflows, prompt engineering, model deployment, and AI-powered automation tools

What the JD emphasized

  • Fluency in written and spoken English is required.
  • Candidate must be a national of an EU member state and residing in the EU to operate the AWS European Sovereign Cloud.