Senior Site Reliability Engineering- Ctj- Secret (cleared Environments)

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Site Reliability Engineering

This role is for a Senior Site Reliability Engineer (SRE) on the Microsoft Substrate platform, which supports critical services like Exchange Online and M365 Copilot. The SRE will focus on ensuring 24x7 service reliability, supporting and automating deployments, building scalable systems for monitoring and alerting, driving compliance and security, leading post-incident learning, collaborating across teams, and staying technically current. The role requires experience with regulated, sovereign, or compliance-sensitive environments and involves specific security clearance requirements for government cloud environments.

What you'd actually do

  1. Act as a Designated Responsible Individual (DRI) in an on-call rotation, leading incident response and resolution to maintain uptime and performance for Microsoft’s most critical services.
  2. Execute and improve manual operations and deployments for our products, while designing automation to scale and streamline those processes across environments.
  3. Develop automation for monitoring, alerting, debugging, and deployment to reduce manual effort and accelerate safe, reliable delivery.
  4. Ensure systems meet Microsoft’s standards for security, privacy, and accessibility, especially when onboarding new technologies.
  5. Conduct postmortems, share insights, and implement solutions that prevent recurrence—fostering a culture of learning and continuous improvement.

Skills

Required

  • Master's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience.
  • Ability to obtain and maintain security clearances for government cloud environments (GCCM, GCCH, DoD)
  • Experience with incident response and resolution
  • Experience with automation for monitoring, alerting, and deployment
  • Experience with cloud or distributed systems

Nice to have

  • Doctorate Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration OR Master's Degree in Computer Science, Information Technology, or related field AND 6+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 8+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience.
  • 3+ years technical experience working with large-scale cloud or distributed systems.

What the JD emphasized

  • regulated, sovereign, or compliance-sensitive environments
  • Microsoft Government cloud environments
  • GCC Moderate (GCCM), GCC High (GCCH), and Department of Defense (DoD) environments
  • obtain and maintain the appropriate background investigations and customer screenings
  • favorably adjudicated Tier 3 (T3) background investigation
  • Criminal Justice Information Services (CJIS) eligibility requirements
  • pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter