Senior Manager, Sre Risk Advisory and Oversight

Capital One Capital One · Banking · McLean, VA +2

Senior Manager role focused on SRE Risk Advisory and Oversight within a financial services company. The role involves providing technical leadership, subject matter expertise, and effective challenge over software engineering and SRE practices, with a focus on cloud services, resilience, and the integration of AI/ML tooling for optimization. Responsibilities include independent risk reviews, executive communication, and stakeholder engagement, ensuring adherence to SRE best practices and enterprise risk appetites. The role operates within the Second Line of Defense, collaborating with first-line technology teams and other risk management offices.

What you'd actually do

  1. Deliver independent, advisory-based technical leadership when assessing the design, development, and scalability of cloud-native systems against SRE best practices and enterprise risk appetites.
  2. Evaluate proposed and current cloud engineering practices, ensuring robust strategies are in place for automation, resiliency, performance, and monitoring.
  3. Act as a trusted advisor on core SRE pillars, advising teams on the maturity of their Service Level Indicators/Objectives (SLIs/SLOs), error budgeting, toil reduction, and release engineering (CI/CD) pipelines.
  4. Keep up-to-date with cutting-edge technology, standards, and tools—specifically cloud-native architectures, containerization, and the integration of emerging AI/ML technologies to optimize reliability and automation.
  5. Conduct independent risk reviews of cloud infrastructure, software delivery lifecycles, and observability architectures to identify systemic resilience risks.

Skills

Required

  • Bachelor’s Degree or military experience
  • At least 6 years of experience managing, consulting, auditing, or working in the fields of software engineering, site reliability engineering, or information technology
  • At least 3 years of experience with cloud implementations (AWS, GCP, Azure)
  • At least 2 years of experience with open-source programming languages

Nice to have

  • Master’s Degree in Computer Science or an Engineering discipline
  • Professional certification (AWS Cloud Practitioner, AWS Certified Solutions Architect, AWS SysOps Administrator)
  • Demonstrated understanding of cloud-native and container stacks
  • Experience with enterprise monitoring, observability, and alerting toolsets (Splunk, Prometheus, Datadog, ELK, PagerDuty)
  • Proven experience drafting analytical assessments or technical white papers for senior executives and decision-makers
  • Ability to work independently in a fast-paced environment, taking a lead advisory role on high-visibility resilience initiatives
  • Prior experience working in financial services or another highly-regulated sector

What the JD emphasized

  • Second Line of Defense
  • independent
  • risk management
  • cybersecurity
  • reliability
  • resilience