System Development Engineer, Aws Ec2 Nitro Team

Amazon Amazon · Big Tech · Santa Clara, CA · Software Development

The System Development Engineer will join the AWS EC2 Nitro team to build and maintain the foundation of cloud computing at scale, focusing on next-generation EC2 machine learning platforms. Responsibilities include developing tools for operational health, automating CI/CD, improving code quality, and mentoring junior engineers. This role is critical for ensuring the performance, availability, scalability, and durability of AWS services.

What you'd actually do

  1. Developing tools to streamline operations as we scale to reduce operational load and improve our overall operational posture. Influencing & driving operational excellence and best practices within the organization.
  2. Identifying ways to increase the automate and improve our test infrastructure and make our CI/CD more robust and flexible
  3. Playing a key role in investigating and recommending best practices for maintaining and improving code quality, fleet health, and security & reliability of our service.
  4. Growing our talent through actively mentoring junior engineers, improving their skills, their knowledge of our systems, and their ability to get things done. Sharing your knowledge with wider teams and writing clear and concise documentation to allow other engineers to get the most out of the service and tools.

Skills

Required

  • 3+ years of programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby experience
  • 5+ years of non-internship professional software development experience
  • 3+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
  • 4+ years of systems development in an IT or data center environment experience
  • 4+ years of deploying and operating in a Linux/Unix environment experience
  • 3+ years of systems design, software development, operations, automation, and process improvement experience
  • Experience leading the design, build and deployment of complex and performant (reliable and scalable) software solutions in production

Nice to have

  • 1+ years of development/programming/scripting language (Python/Java/Bash/Perl) experience
  • Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
  • Experience taking a leading role in building complex software or computing infrastructure that has been successfully delivered to customers

What the JD emphasized

  • building the foundation of cloud computing at unprecedented scale
  • developing the next generation of EC2 machine learning platforms
  • maintaining the operational health of the service
  • maintaining build & release systems
  • ensuring maximum up-time for our developers and customers
  • experience leading the design, build and deployment of complex and performant (reliable and scalable) software solutions in production