This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Manager, SRE Risk Advisory and Oversight at Capital One. Capital One is one of the fastest growing organizations in the world today, powered by our passion for our customers. We are serious about technology, we dream big, and we execute: Capital One moved our entire enterprise to the public cloud over the course of five years. Just as we prioritize driving innovation through technology, we equally prioritize cybersecurity, reliability, software quality, and data management.
Job Responsibility
Perform Deep-Dive Risk Analysis: Conduct independent, technical risk assessments of cloud infrastructure architectures, software delivery lifecycles, and observability frameworks to identify systemic resilience and stability risks
Support Effective Challenge: Evaluate first-line cloud engineering practices against enterprise risk appetites, ensuring robust strategies are maintained for automation, system resiliency, performance, and monitoring
Build Storytelling & Reporting Materials: Partner with team leadership (Sr. Managers and Directors) to translate complex, highly technical engineering data into structured risk reports, presentation decks, and executive storytelling materials
SRE Subject Matter Expertise: Serve as a trusted technical analyst on core SRE pillars, assessing the design and maturity of Service Level Indicators/Objectives (SLIs/SLOs), error budgets, release pipelines (CI/CD), and toil reduction efforts
Evaluate AI & Tech Integration: Actively evaluate the integration of cutting-edge technologies—specifically cloud-native stacks, containerization, and the application of emerging Gen AI/ML tooling within software delivery—to ensure reliable operational boundaries
Formulate Risk Recommendations: Collaborate across the second line of defense to design, adjust, and recommend appropriate mitigating controls and guardrails for emerging cloud tech
Stakeholder Partnership: Build and maintain collaborative relationships with first-line engineers, architects, and technical owners to ensure risk assessments are thoroughly understood and communicated transparently
Requirements
Bachelor's Degree or military experience
At least 4 years of experience in Technology Management, Software Engineering, Site Reliability Engineering, or Cyber Risk Management
At least 2 years of experience with cloud implementations (AWS, GCP, or Azure)
At least 1 year of experience with open-source programming languages
Nice to have
Master's Degree in Computer Science, Computer Engineering, or a relevant technical discipline
Professional cloud or infrastructure certification (AWS Certified Solutions Architect, AWS SysOps Administrator)
Experience analyzing or utilizing enterprise monitoring, observability, and alerting toolsets (Splunk, Prometheus, Datadog, ELK, PagerDuty)
Demonstrated understanding of cloud-native systems, containerization stacks (Kubernetes), and CI/CD pipelines
Proven experience drafting technical assessments or presentation materials used to communicate technical findings to senior leadership
Strong communication and interpersonal skills, with the ability to influence and drive technical alignment across stakeholder groups
Prior experience working within financial services or another highly-regulated industry
What we offer
performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)