Site Reliability Engineer II - Ctj - Top Secret

Microsoft Microsoft · Big Tech · Redmond, WA +2 · Site Reliability Engineering

Site Reliability Engineer II for Microsoft Defender, focusing on building and delivering cloud solutions for US Government clouds. Responsibilities include live site operations, incident response, automation, compliance, and collaboration with engineering teams to ensure stability and performance of security products in highly sensitive environments.

What you'd actually do

  1. Serve as a Designated Responsible Individual (DRI) in a 24x7 on-call rotation, monitoring service health and responding to incidents within SLA timelines.
  2. Contribute to automation efforts and validate code functionality in non-production environments to ensure smooth deployments.
  3. Support compliance processes by verifying security, privacy, and accessibility standards during onboarding of new technologies.
  4. Stay current with industry trends and internal tools to improve reliability, performance, and observability at scale.
  5. Apply proven development and scaling practices to meet performance and customer requirements.

Skills

Required

  • Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience.
  • active TS and be willing and eligible to upgrade to TS/SCI (with polygraph) or have an active TS/SCI and be willing and eligible to upgrade to TS/SCI (with polygraph).
  • maintain the TS/SCI (with polygraph) clearance.
  • meet Microsoft, customer and/or government security screening requirements
  • pass the Microsoft Cloud background check

Nice to have

  • Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 5+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience.
  • 2+ years technical experience working with large-scale cloud or distributed systems.
  • Demonstrated experience applying software engineering principles to production systems, including designing, building, or improving services and platforms.
  • Proficiency in one or more programming languages such as C#, Go, Java, or Python, with the ability to develop and maintain production-quality code.
  • Experience with automation that results in measurable improvements (e.g., reduced toil, fewer manual steps, improved system reliability).
  • Experience with debugging and troubleshooting complex distributed systems in production environments.
  • Ability to independently identify problems and implement solutions that improve system reliability and operational efficiency.
  • Hands-on experience with CI/CD pipelines, testing, d

What the JD emphasized

  • highly sensitive and secure government environments
  • US Government clouds
  • TS/SCI (with polygraph)
  • maintain the TS/SCI (with polygraph) clearance
  • security screening requirements