Senior Systems Administration

AT&T AT&T · Telecom · Alpharetta, GA

Senior Systems Administration role focused on the 24x7 operations, administration, and maintenance of server infrastructure in a Linux-based Network Cloud environment. Responsibilities include software deployment, system monitoring, troubleshooting, and managing changes within the OpenStack and Kubernetes ecosystem. Requires experience with Linux, cloud platforms, scripting (Python, shell), and CI/CD pipelines.

What you'd actually do

  1. Responsible for 24x7 operations, administration, and maintenance of server infrastructure in AT&T’s NC (Network Cloud) Linux based environment.
  2. Perform deployment of software updates on cloud platform comprised of Linux OS, BIOS/ firmware, OpenStack, Kubernetes, Calico, Ceph, Maria DB and other software components.
  3. Perform work focusing on software, maintenance, and operations of systems.
  4. Apply trouble-shooting and analytical skills in Systems Administration of Linux, Cloud, OpenStack, and Kubernetes environment.
  5. Create, review, approve, and implement changes in NC server environment.

Skills

Required

  • Linux
  • KVM
  • OpenStack
  • Kubernetes
  • Containers
  • Cloud Infrastructure platforms
  • Server Infrastructure monitoring
  • Trouble-shooting
  • Analytical skills
  • Systems Administration
  • High Availability
  • DRS
  • Fault Tolerance
  • Scalability
  • Reliability
  • Change management
  • Application/service restoration
  • Python scripting
  • Shell scripting
  • CI/CD Pipeline
  • Jenkins
  • GitHub

What the JD emphasized

  • Requires a Bachelor degree, or foreign equivalent degree, in Information Systems Technology, Computer Science, Computer Engineering, or Electronics Engineering and 3 years of experience in the job offered or 3 years of experience in a related occupation utilizing Linux, KVM, OpenStack, Kubernetes, Containers, Cloud Infrastructure platforms and monitoring of the server Infrastructure; applying trouble-shooting and analytical skills in Systems Administration of Linux, Cloud, OpenStack, and Kubernetes environment; applying knowledge in operations and maintenance of OpenStack modules; applying knowledge with concepts including High Availability, DRS, Fault Tolerance, Scalability, and Reliability; creating, reviewing, approving, and implementing changes in NC server environment; applying knowledge in restoration of applications/services in NC infrastructure from hardware/ OpenStack/OS perspective in outage /service degradation scenarios; utilizing Python and shell scripting; applying knowledge with deployment of NC Server infrastructure including Linux, OpenStack and Kubernetes; applying knowledge of CI/CD Pipeline in Jenkins and GitHub; coordinating with OS vendors, development teams, and production support teams for root cause analysis and configuration changes; coordinating with hardware vendors for faulty Hardware replacement.