Data Center Technician (joinoci-ns2)

Oracle Oracle · Enterprise · TX

This role is for a Data Center Technician at Oracle, focusing on maintaining and troubleshooting core data center infrastructure, including power, cooling, servers, and networking. The position involves acting as a technical liaison, monitoring systems, resolving issues, improving processes, and creating documentation. A strong understanding of data center design and operations, incident management, and systems administration is required, along with an active TS/SCI security clearance.

What you'd actually do

  1. As an Oracle Data Center Engineer, you will be the technical liaison between the technology teams and the Data Center Environment and will be key in maintaining the Operational run aspects.
  2. You will troubleshoot and solve all but the most complex infrastructure issues.
  3. You will proactively monitor the customer environment by checking system error logs, monitoring ticket queues and consulting with other groups involved in maintaining the environments.
  4. You create and maintain documentation on technologies you support.
  5. Expected to improve current processes, and introduce automation with aim towards simplification.

Skills

Required

  • Active TS/SCI security clearance w/polygraph
  • Bachelor of Science degree in Computer Science, Information Systems or equivalent work experience.
  • 3-8 years of experience, primarily in Data Center infrastructure Operations support and server administration in a mid-sized environment (200 - 1000+ server systems)
  • 3-8 years of incident management responding to electrical and mechanical failures with a focus on incident resolution and lessons learned.
  • Demonstrated expertise in two or more of these areas: Power Capacity allocation – high and low density racks, Incident management and resolution on outages, Systems administration (Linux and/or Windows Servers), Networking (DNS, TCP/IP)
  • Solid understanding of data center design, commissioning and operations best practices
  • Process improvement execution
  • Ability to dig into the details of a system or process to solve customer problems
  • Excellent oral and written communication skills
  • Strong Adherence to Process
  • Strong influencing skills
  • Ability to work independently, with little direct management
  • Experience supporting large, Enterprise customers in an Operations environment
  • Understand the design and functionality of the Data Centers within your assigned Region
  • Provide audits for power and mechanical capacity or upgrades.
  • Work with internal teams to trouble shoot problems and conduct Root Cause Analysis (RCA) and Corrective Action (CA) for design related problems.
  • Work with local colocation companies to understand and coordinate site utility requirements
  • Provide after-hours support as needed
  • Work with project teams/colocation partners to properly test and validate installation, operation, and performance of electrical/mechanical systems.
  • Support of Operations including failure mode and root cause analysis, maintenance and troubleshooting support, best practices, maintenance initiatives and operating procedure review
  • Maintain all technical documentation regarding corporate data centers, this includes procedures for the operations.
  • Work with Regional leaders and other business leaders to manage projects, optimize performance and improve the reliability and efficiency of the collocation, leased and owned data centre. infrastructure electrical and mechanical systems.
  • Participate in operational reviews to collect and analyze technical data to identify and resolve existing reliability and availability concerns.
  • Provide Subject matter Expert resource to identify and resolve resiliency, reliability and availability risks globally.
  • Oversee the Issue intake, Evaluation, and Resolution Process for the review of collocation, leased and owned data centre builds issues with focus on providing quality improvement recommendations.
  • Interface with internal data centre design teams, server hardware teams, environmental health and safety teams to promote standards that maintain consistency and reliability in services delivered
  • Be recognized as the technical expert within the group as well as within other teams.
  • Be positive and always offer creative, out of the box solutions.

Nice to have

  • experience from a reputable Cloud provider
  • experience from Tier 1 Data Center colocation providers
  • evolving their career upstream into Cloud Services

What the JD emphasized

  • Active TS/SCI security clearance w/polygraph
  • Must have demonstrated expertise in two or more of these areas: Power Capacity allocation – high and low density racks, Incident management and resolution on outages, Systems administration (Linux and/or Windows Servers), Networking (DNS, TCP/IP)
  • A solid level understanding of data center design, commissioning and operations best practices and ensures their application
  • Examples of process improvement execution
  • Examples of a drive to dig into the details of a system or process to solve customer problems
  • Strong Adherence to Process and be process champion
  • Ability to work independently, with little direct management Experience supporting large, Enterprise customers in an Operations environment
  • Provide audits for power and mechanical capacity or upgrades.
  • Work with internal teams to trouble shoot problems and conduct Root Cause Analysis (RCA) and Corrective Action (CA) for design related problems.
  • Work with project teams/colocation partners to properly test and validate installation, operation, and performance of electrical/mechanical systems.
  • Support of Operations including failure mode and root cause analysis, maintenance and troubleshooting support, best practices, maintenance initiatives and operating procedure review
  • Maintain all technical documentation regarding corporate data centers, this includes procedures for the operations.
  • Work with Regional leaders and other business leaders to manage projects, optimize performance and improve the reliability and efficiency of the collocation, leased and owned data centre. infrastructure electrical and mechanical systems.
  • Participate in operational reviews to collect and analyze technical data to identify and resolve existing reliability and availability concerns.
  • Provide Subject matter Expert resource to identify and resolve resiliency, reliability and availability risks globally.
  • Oversee the Issue intake, Evaluation, and Resolution Process for the review of collocation, leased and owned data centre builds issues with focus on providing quality improvement recommendations.
  • Interface with internal data centre design teams, server hardware teams, environmental health and safety teams to promote standards that maintain consistency and reliability in services delivered
  • Be recognized as the technical expert within the group as well as within other teams.