Site Reliability Engineering- Ctj- Poly

Microsoft Microsoft · Big Tech · Redmond, WA +2 · Site Reliability Engineering

Site Reliability Engineer for Microsoft Sovereign Cloud, focusing on Windows 365 Cloud PC/Azure Virtual Desktop. Responsibilities include incident response, troubleshooting, developing automation, and ensuring service reliability, security, and performance in highly regulated environments. Requires a Bachelor's degree in Computer Science or related field, 2+ years of technical engineering experience with coding, and an active U.S. Government Top Secret Clearance with SCI and Polygraph.

What you'd actually do

  1. Responds to incidents during regular on-call rotations by identifying the level of impact, troubleshooting basic issues, taking appropriate action to mitigate impact, and deploying appropriate fixes to resolve root cause(s). Notifies product teams and owners to major customer impacting issues and escalates the resolution of complex issues and/or those affecting multiple components or features to other engineers as needed. Contributes details and resolutions through post-mortem reports and review meetings.
  2. Uses existing tools to troubleshoot problems or flaws affecting the availability, security, reliability, performance, and/or efficiency of components or features with guidance from other engineers. Suggests potential solutions to resolve and prevent recurring issues and brings them to the attention of other engineers or team leads.
  3. Develops an understanding of how to safely and reliably manage changes in production by using existing tools and automation, including the safe deployment process (SDP), to enable product engineering teams implement changes across a defined range of components or features, with direction from other engineers.
  4. Develops an understanding of the code, features, and operations of specific products at scale as required to contribute to incremental improvements in product availability, security, quality, observability, reliability, efficiency, observability, and/or performance. Participates in on-boarding, code/design reviews, and regular meetings with the engineering teams that develop and/or manage those products.
  5. Supports ongoing engagements with product engineering teams by participating in code/design reviews, regular meetings, on-call rotations, and incident responses throughout product development and operations cycles. Draws insights from product engineering teams and basic analyses of telemetry data to propose potential improvements to code and designs for a defined set of product components or features with guidance from other engineers.

Skills

Required

  • Bachelor's Degree in Computer Science or related technical field
  • 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph
  • Microsoft Cloud background check

Nice to have

  • Master's Degree in Computer Science or related technical field
  • 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python

What the JD emphasized

  • highly regulated environments
  • active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph