CrawlJobs Logo

Senior Lead Software Engineer, DevOps (Cloud Operations Resilience Engineering)

capitalone.com Logo

Capital One

Location Icon

Location:
United States , McLean

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

209000.00 - 262400.00 USD / Year

Job Description:

Senior Lead Software Engineer, DevOps (Cloud Operations Resilience Engineering). Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who love to solve real problems and meet real customer needs. We are seeking DevOps Engineers who are passionate about marrying data with emerging technologies to join our team. As a DevOps Engineer, you’ll have the opportunity to be on the forefront of driving a major transformation within Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One’s foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. As a Capital One Senior Lead of Software Engineering in Cloud Runtimes, you’ll play a leading role in delivering the latest AWS/Cloud Infrastructure capabilities for use across the enterprise. You’ll bring solid experience in emerging and traditional technologies such as containers, Terraform, AWS CDK, Kubernetes, ECS, durable workflow execution, continuous reconciliation, and polyglot programming to bear in delivering platforms for infrastructure management. You’ll explore how to leverage GenAI and advanced automation techniques to reduce or eliminate infrastructure failures that impact the availability of customer-facing web and mobile applications.

Job Responsibility:

  • Work within and across Agile teams to design, develop, test, implement, and support technical solutions across full-stack development tools and technologies
  • Lead the craftsmanship, availability, resilience, and scalability of your solutions
  • Bring a passion to stay on top of tech trends, experiment with and learn new technologies, participate in internal & external technology communities, and mentor other members of the engineering community
  • Encourage innovation, implementation of cutting-edge technologies, inclusion, outside-of-the-box thinking, teamwork, self-organization, and diversity
  • Work across boundaries to improve the velocity of your and other teams
  • Lead efforts to enable and simplify the use of new and existing AWS services
  • Work with product managers to understand desired application and platform capabilities and testing scenarios

Requirements:

  • Bachelor’s degree
  • At least 6 years of experience in DevOps Engineering (Internship experience does not apply)
  • At least 4 years of experience with Cloud Native technologies (Amazon Web Services, Microsoft Azure, Google Cloud Platform)
  • At least 6 years of Unix or Linux system administration experience

Nice to have:

  • Master’s Degree
  • At least 10 years of experience in DevOps Engineering
  • At least 5 years of experience in infrastructure design, implementation and delivery
  • At least 3+ years of experience in Agile practices
  • At least 3+ years of experience with monitoring tools (OTel or Grafana)
  • AWS and CNCF certifications
What we offer:
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being

Additional Information:

Job Posted:
April 05, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Lead Software Engineer, DevOps (Cloud Operations Resilience Engineering)

Associate Head - Software Engineering

Alter Domus India develops and licenses a growing family of proprietary software...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
alterdomus.com Logo
Alter Domus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science or a related field (or equivalent work experience)
  • Seasoned engineering senior manager with minimum 14+ years of experience managing a team and global stakeholders
  • Strong professional experience in full stack development, with a strong focus on Angular, .NET, and .NET Core
  • Very strong expertise in developing and integrating RESTful APIs, with a deep understanding of asynchronous request handling
  • Strong understanding of technology architectures, programming, databases, and cloud computing
  • Cloud platform-agnostic skills are preferred, enabling flexibility in technology selection
  • Excellent leadership, communication, and interpersonal skills to effectively manage teams and collaborate with stakeholders
  • Ability to identify problems, analyze data, and develop effective solutions that meet business needs
  • Proven experience in managing multiple projects simultaneously, overseeing implementation, and ensuring successful delivery
  • Ability to think strategically, develop long-term plans, and make decisions that align with business objectives
Job Responsibility
Job Responsibility
  • Develop and implement technology transformation strategies that align with business goals
  • Identify areas for improvement and propose innovative technologies to enhance operational efficiency
  • Design and oversee the implementation of new architectures across application, data, integration, and security domains
  • Lead the design and delivery of technology solutions that meet business needs and adhere to industry standards
  • Collaborate with cross-functional teams and clients to understand requirements and translate them into effective technical solutions
  • Evaluate and recommend new technologies, tools, and platforms to support business transformation efforts
  • Promote the culture of continuous improvement, innovation and upskilling in the team
  • Oversee the implementation of new technologies and solutions, managing project timelines and budgets to ensure successful delivery across multiple projects simultaneously
  • Continuously monitor and optimize technology performance, identifying areas for improvement and implementing strategies to enhance efficiency
  • Provide mentorship and guidance to junior engineers and team members
What we offer
What we offer
  • Support for professional accreditations such as ACCA and study leave
  • Flexible arrangements, generous holidays, birthday leave
  • Continuous mentoring along your career progression
  • Active sports, events and social committees across our offices
  • Support with mental, physical, emotional and financial support 24/7 from our Employee Assistance Program
  • The opportunity to invest in our growth and success through our Employee Share Plan
  • Plus additional local benefits depending on your location
Read More
Arrow Right

Founding Engineering Manager

Our company is seeking an experienced Software Engineering Manager to lead and e...
Location
Location
United States , Brentwood
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of detail oriented software engineering experience
  • 3+ years leading or managing software engineering teams
  • Proven record in building, scaling, or transforming engineering organizations
  • Hands-on experience owning and supporting production systems
  • Deep understanding of modern software development and operational practices (e.g., Git, CI/CD, DevOps, cloud infrastructure, software lifecycle)
  • Strong leadership, communication, and organizational capabilities
  • Experience partnering with cross-functional teams and business stakeholders
  • Ability to commute to Long Island, NY under a hybrid work model
Job Responsibility
Job Responsibility
  • Engineering Organization Leadership Establish and oversee the engineering organization, including hiring, mentoring, and managing a multi-disciplinary team
  • Cultivate a culture based on ownership, accountability, operational excellence, and continual improvement
  • Develop and refine engineering processes, organizational structure, and technical best practices
  • Align engineering objectives with business strategy in partnership with senior leaders and stakeholders
  • Production Reliability & Operational Ownership Champion the reliability and stability of production systems supporting critical operations
  • Implement and manage support processes, including incident response, on-call rotations, and root-cause analysis
  • Develop and execute operational excellence programs emphasizing monitoring, scalability, and performance optimization
  • Foster a sense of engineering accountability for production system continuity and resilience
  • Global Engineering Collaboration Drive effective coordination and communication with distributed engineering teams in a global technology environment
  • Synchronize development and operational efforts across regions, ensuring alignment and best practices
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
Read More
Arrow Right

Founding Engineering Manager

Our company is seeking an experienced Software Engineering Manager to lead and e...
Location
Location
United States , Edgewood, NY
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of detail oriented software engineering experience
  • 3+ years leading or managing software engineering teams
  • Proven record in building, scaling, or transforming engineering organizations
  • Hands-on experience owning and supporting production systems
  • Deep understanding of modern software development and operational practices (e.g., Git, CI/CD, DevOps, cloud infrastructure, software lifecycle)
  • Strong leadership, communication, and organizational capabilities
  • Experience partnering with cross-functional teams and business stakeholders
  • Ability to commute to Long Island, NY under a hybrid work model
Job Responsibility
Job Responsibility
  • Engineering Organization Leadership Establish and oversee the engineering organization, including hiring, mentoring, and managing a multi-disciplinary team
  • Cultivate a culture based on ownership, accountability, operational excellence, and continual improvement
  • Develop and refine engineering processes, organizational structure, and technical best practices
  • Align engineering objectives with business strategy in partnership with senior leaders and stakeholders
  • Production Reliability & Operational Ownership Champion the reliability and stability of production systems supporting critical operations
  • Implement and manage support processes, including incident response, on-call rotations, and root-cause analysis
  • Develop and execute operational excellence programs emphasizing monitoring, scalability, and performance optimization
  • Foster a sense of engineering accountability for production system continuity and resilience
  • Global Engineering Collaboration Drive effective coordination and communication with distributed engineering teams in a global technology environment
  • Synchronize development and operational efforts across regions, ensuring alignment and best practices
What we offer
What we offer
  • medical, vision, dental, and life and disability insurance
  • eligible to enroll in our company 401(k) plan
Read More
Arrow Right

Senior Cloud Engineering Manager

Location
Location
Poland , Warszawa
Salary
Salary:
210.00 - 250.00 PLN / Hour
devire.pl Logo
Devire
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of professional experience in software engineering or infrastructure engineering, with deep expertise in public cloud environments (preferably AWS)
  • Proven leadership experience as an Engineering Manager, Technical Lead, or similar role overseeing cloud-native or platform engineering teams
  • Strong knowledge of cloud architecture principles, distributed systems design, microservices, and container orchestration technologies (e.g., Docker, Kubernetes)
  • Demonstrated experience building secure, scalable, and highly available systems within large, enterprise-scale environments
  • Experience designing and operating hybrid cloud architectures integrating public cloud with on-premise systems
  • Solid understanding of DevOps practices, CI/CD pipelines, Infrastructure as Code (e.g., Terraform, CloudFormation), and automation frameworks
  • Bachelor’s degree in Computer Science, Engineering, or a related technical discipline (or equivalent practical experience)
  • Excellent leadership, stakeholder management, and communication skills, with the ability to influence at both technical and executive levels
Job Responsibility
Job Responsibility
  • Lead, mentor, and scale a team of cloud engineers, fostering technical excellence, accountability, and continuous improvement
  • Define and execute the roadmap for reusable cloud platform capabilities, reference architectures, and standardized deployment patterns
  • Drive enterprise adoption of public cloud platforms (primarily AWS) by enabling seamless migration of legacy systems and supporting the development of cloud-native applications
  • Establish and promote a streamlined, developer-centric cloud onboarding experience to accelerate time-to-market
  • Ensure all cloud solutions are architected with a security-first and compliance-driven mindset, meeting regulatory and internal governance standards
  • Implement architectural best practices to guarantee high availability, resilience, scalability, and performance across cloud environments
  • Collaborate with cross-functional platform, security, and infrastructure teams to align on long-term cloud strategy and operating models
  • Provide executive-level visibility into cloud adoption progress, highlighting milestones, risks, and mitigation strategies
  • Engage directly with internal stakeholders to gather feedback, prioritize enhancements, and continuously improve platform capabilities
  • Oversee cloud governance processes, ensuring production deployments adhere to enterprise standards and regulatory requirements
What we offer
What we offer
  • Private HealthCare
  • Sports card
  • Life insurance
  • Working for a leading corporation with a stable market position
  • Working in the international environment
  • Hybrid work (3 days a week office work) with supportive and positive environment
  • Fulltime
Read More
Arrow Right

Director of Engineering

The Director of Engineering leads K12 Tutoring’s platform engineering organizati...
Location
Location
United States
Salary
Salary:
132000.00 - 199000.00 USD / Year
stridelearning.com Logo
Stride, Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, engineering, or related field, or equivalent practical experience
  • 10+ years of progressive experience in software engineering roles, including senior leadership of multi-team organizations
  • Proven experience leading engineering teams at scale (20+ staff and contractors) delivering complex, customer-facing platforms
  • Demonstrated success driving Agile delivery in high-growth or transformation environments
  • Strong background in modern cloud-based architectures, DevOps practices, and distributed systems
  • Experience building secure, compliant systems in regulated environments (e.g., education, healthcare, fintech, or government)
  • Track record of shipping new product capabilities—including data-driven or AI-enabled features—into production
  • Excellent people leadership skills, including coaching managers, building performance cultures, and developing technical leaders
  • Experience establishing engineering metrics and operating models that improve velocity, quality, and predictability
  • Strong strategic thinking paired with hands-on execution orientation
Job Responsibility
Job Responsibility
  • Lead and scale a distributed organization of Software Engineers, QA professionals, and Scrum Masters, fostering a culture of accountability, learning, and high performance
  • Own delivery execution across multiple Agile teams, ensuring predictable outcomes aligned to business priorities, customer commitments, and regulatory requirements
  • Partner with Product, Business, Data Engineering, AI, Information Security, and Operations leaders to shape roadmaps, scope initiatives, and deliver differentiated platform capabilities
  • Set and evolve engineering operating rhythms, including sprint planning, release management, dependency coordination, and continuous improvement practices
  • Drive architectural standards and technical strategy across the platform, ensuring solutions are secure, compliant, performant, accessible, and scalable
  • Make strategic trade-offs between short-term feature delivery and long-term technical investments, including identifying, prioritizing, and addressing technical debt to reduce risk and ensure sustainable scalability and efficiency
  • Promote adoption of engineering best practices while maintaining agility and avoiding over-engineering
  • Thrive in fast-paced, high-growth startup environments, balancing hands-on technical guidance with team leadership and execution
  • Establish and track engineering metrics such as delivery throughput, quality, reliability, security posture, and team health
  • use data to improve outcomes
What we offer
What we offer
  • health benefits
  • retirement contributions
  • paid time off
  • Fulltime
Read More
Arrow Right

Senior Staff Software Engineer- GIA Platform

GEICO is seeking an experienced software engineer with a passion for building hi...
Location
Location
United States , Palo Alto
Salary
Salary:
130000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Fluency in at least one modern language (Go is preferred, .Net is a plus)
  • Proven track record of designing, implementing, and maintaining highly scalable, available and reliable system in production
  • Understanding of security best practices and data encryption technology
  • Understanding of SQL and NoSQL databases, including stateful services management and storage
  • Understanding of networking, caches, key/value stores, load balancing, global load balancing, queues, DNS and CDN
  • Deep knowledge of DevOps practices, methodologies, and principles, along with a solid understanding of on prem and public cloud-based network, compute, and storage technologies
  • In-depth knowledge of hybrid cloud architecture, IaaS and PaaS technologies, container orchestration platforms (e.g., Kubernetes), cloud efficiency and observability etc.
  • Strong background in incident management
  • Ability to create incident response playbooks, runbooks, incident triaging strategies, and post-incident analysis to drive continuous improvement in system reliability and availability
  • Experience with open-source management and monitoring tools
Job Responsibility
Job Responsibility
  • Develop and drive the overall technical roadmap for the GIA Platform organization, aligning it with the organization's business goals and objectives
  • Work closely with executive leadership, tech teams, and other cross-discipline stakeholders to build optimal strategy for delivering platform services
  • Leverage technical and domain expertise to influence partners and leadership to create a force multiplier in achieving milestones in the team’s technical roadmap
  • Provide thought leadership in GIA Platform, staying ahead of industry trends and emerging technologies to create effective strategy that minimizes business disruption while balancing the modernization of legacy platform components
  • Lead the design and architecture of resilient and scalable platform services, considering both on-premises and cloud-based solutions
  • Champion software development best practices and safe deployment processes to enable continuous, incremental delivery of business values
  • Contribute directly to and leading by example in day-to-day engineering activities (writing feature code and automated tests, raising PRs and reviewing peers’ PRs, developing and managing CI/CD pipelines, production support, among others)
  • Develop and maintain comprehensive incident response plans to address various disaster scenarios across multiple partner integration points
  • Spearhead collaboration with various stakeholders in production readiness assessment and operational excellence
  • Hands-on software engineering and SDLC best practices (Technical Review Documents, Architecture, Software Development, Code Reviews, Testing, Production Readiness Reviews, among others)
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Senior Software Engineer – AWS Developer

We’re looking for a Senior Software Engineer (AWS Developer) to lead the design ...
Location
Location
United States , San Diego
Salary
Salary:
Not provided
resmed.com Logo
ResMed
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of professional software development experience
  • Significant hands-on work in AWS-based production systems
  • Strong proficiency in Python with deep understanding of object-oriented design, clean code principles, and design patterns
  • Expertise with AWS services, especially serverless and cloud-native architectures, including several of: Lambda, API Gateway, DynamoDB, S3, SQS/SNS, EventBridge, CloudWatch, CloudFront, RDS/Aurora, and IAM
  • Solid experience with infrastructure-as-code (e.g., Terraform, CloudFormation, CDK) and multi-environment deployments
  • Strong grasp of RESTful API design, authentication/authorization mechanisms (OAuth2, JWT), and microservices / event-driven architectures
  • Practical experience designing and optimizing data models for both NoSQL (e.g., DynamoDB, MongoDB) and relational databases (e.g., PostgreSQL, MySQL)
  • Experience with DevOps practices: CI/CD (e.g., GitHub Actions, CodePipeline), Git workflows, Docker, and monitoring/observability tools (e.g., CloudWatch, Datadog)
  • Deep understanding of software testing strategies (unit, integration, contract, and end-to-end testing) and how to embed them into pipelines (e.g., Cypress or similar)
  • Strong communication skills, a collaborative mindset, and a track record of influencing technical direction, aligning stakeholders, and mentoring other engineers
Job Responsibility
Job Responsibility
  • Lead the design, development, testing, and operation of cloud-native software systems that are reliable, scalable, secure, and cost-effective
  • Own end-to-end architecture for services and features on AWS, making informed tradeoffs between serverless, containers, data stores, and integration patterns
  • Collaborate closely with engineers, product managers, designers, and architects to translate complex requirements into clear technical designs and implementation plans
  • Set the bar for code quality, testing, and engineering practices
  • write clean, maintainable, well-tested code and help others do the same
  • Conduct and drive code and design reviews, provide constructive feedback, and foster a culture of technical excellence and continuous improvement
  • Investigate and resolve complex production issues, performance bottlenecks, and reliability problems across multiple services and components
  • Shape and evolve our CI/CD pipelines, deployment strategies, and observability (logging, metrics, tracing, alerting) to improve developer productivity and system resilience
  • Mentor and coach associate and mid-level engineers, supporting their growth through pairing, feedback, and knowledge sharing
  • Contribute to and influence technical roadmaps, standards, and best practices for our AWS usage and overall system architecture
  • Fulltime
Read More
Arrow Right

Senior Director, IT Infrastructure, Greater China

This role is responsible to ensure the scalable, secure, and efficient IT infras...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
https://www.marriott.com Logo
Marriott Bonvoy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s in Computer Science, IT, or related field
  • Master’s preferred
  • 10+ years in IT infrastructure, with 5+ years in leadership
  • Proven expertise in cloud platforms (Alicloud/AWS/Azure), ACK/EKS and DevOps practices
  • Familiarity with WAN technologies (future-focused such as SD-WAN)
  • Experience in multi-cloud management
  • Be familiar with Agile methodology, CI/CD tools (Jenkins, GitHub, GitLab, etc.), networking (TCP/IP, DNS, BGP)
  • Be familiar with security standards (ISO 27001, NIST) and monitoring tools
  • Strong leadership, communication, and problem-solving abilities
  • Familiarity with infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation)
Job Responsibility
Job Responsibility
  • Strategic Leadership: Develop and execute the organization’s cloud & network infrastructure, DevOps roadmap in line with continent(Great China) tech strategy and global standard
  • Align infrastructure initiatives with business goals, emphasizing cost optimization and agility
  • Ensure adherence to IT policies, standards, and regulations related to cloud and network management
  • Drive innovation and continuous improvement in cloud infrastructure and DevOps processes
  • Cloud Management & Architecture: Oversee cloud operations (Alicloud, AWS) including migration, resource allocation, and cost management
  • Design resilient, scalable cloud architectures for SaaS, IaaS, and PaaS environments
  • Develop and implement disaster recovery and business continuity plans in the cloud environment
  • Monitor and analyze cloud performance metrics to identify areas for improvement and cost optimization
  • Cloud Network & Security: Optimize cloud network performance (VPCs, VPNs, CDNs) and troubleshoot connectivity
  • Implement security frameworks (Zero Trust, encryption) and ensure compliance (CBDT, GDPR, etc.)
  • Fulltime
Read More
Arrow Right