CrawlJobs Logo

Lead Software Engineer, DevOps (Cloud Operations Resilience Engineering)

capitalone.com Logo

Capital One

Location Icon

Location:
United States , McLean

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

179400.00 - 225100.00 USD / Year

Job Description:

Lead Software Engineer, DevOps (Cloud Operations Resilience Engineering). Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who love to solve real problems and meet real customer needs. We are seeking DevOps Engineers who are passionate about marrying data with emerging technologies to join our team. As a DevOps Engineer, you’ll have the opportunity to be on the forefront of driving a major transformation within Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One’s foundational cloud infrastructure layer, including observability, connectivity, resilience and availability.

Job Responsibility:

  • Lead a portfolio of diverse technology projects and a team of developers with deep experience in machine learning, distributed microservices, and full stack systems to create solutions that help meet regulatory needs for the company
  • Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  • Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
  • Utilize programming languages like Java, Python, SQL, Ruby and Go, Container Orchestration services including Docker and Kubernetes, CM tools including Ansible and Terraform, and a variety of AWS tools and services

Requirements:

  • Bachelor’s degree
  • At least 4 years of experience in DevOps Engineering (Internship experience does not apply)
  • At least 3 years of experience in Cloud Native technologies (Amazon Web Services, Microsoft Azure, Google Cloud Platform)
  • At least 4 years of Unix or Linux system administration experience

Nice to have:

  • 7+ years of DevOps Engineering experience
  • 4+ years of experience with coding and scripting (Python, SQL, Java, JavaScript, Golang, Bash, Perl or Ruby)
  • 4+ years of experience with technologies Apache Mesos, Marathon, or Apache Spark
  • 4+ years of experience using build and deployment tools (Jenkins, Docker)
  • 2+ years of experience with distributed database systems (Splunk, ElasticSearch)
  • 2+ years of experience with deploying clustered web services
  • 2+ years of experience working within Agile Development Practices
What we offer:
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being

Additional Information:

Job Posted:
March 13, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Lead Software Engineer, DevOps (Cloud Operations Resilience Engineering)

Engineering Lead Analyst

Engineering Lead Analyst position in Citi's Cloud Technology Services (CTS) team...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12 Plus years of relevant experience in an Engineering role
  • Deep understanding of public cloud services adoption at scale
  • Expert-level understanding of AWS/GCP Cloud Network across Internet Application Hosting, B2B Connectivity, and Application Resiliency
  • Infrastructure as Code (IaC) Hands On Expertise with Python and Go
  • CI/CD experience with Terraform, Harness, Tekton, Jenkins, etc.
  • Testing Automation experience with Terratest, Cucumber, PytestBD, AWS Fault Injection Simulator (FIS), Chaos Mesh, etc.
  • Familiarity with Agile Development, DevOps, and SRE practices
  • Demonstrated ability to quickly learn new technologies and adapt to changing project requirements
  • Experience evaluating complex requirements and rationalizing them into consistent service offering
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Technical Expertise: hands-on technical contribution within product team focused on public cloud network
  • Collaborative Development: contribute to team of cloud engineers and full-stack software developers
  • Automation: Identify and develop automation initiatives to improve processes related to public cloud services consumption
  • Cross-Functional Partnership: collaborate with teams across Citi's technology landscape
  • Engineering Excellence: contribute to defining and measuring success criteria for service availability and reliability
  • Compliance Advocacy: ensure adherence to relevant standards, policies, and regulations
  • Serve as technology subject matter expert for internal and external stakeholders
  • Provide direction for firm mandated controls and compliance initiatives
  • Define necessary system enhancements to deploy new products and process enhancements
  • Recommend product customization for system integration
What we offer
What we offer
  • Career growth opportunities
  • Opportunity to give back to community
  • Make real impact
  • Global team environment
  • Well-being support
  • Work-life balance programs
  • Fulltime
Read More
Arrow Right

Associate Head - Software Engineering

Alter Domus India develops and licenses a growing family of proprietary software...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
alterdomus.com Logo
Alter Domus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science or a related field (or equivalent work experience)
  • Seasoned engineering senior manager with minimum 14+ years of experience managing a team and global stakeholders
  • Strong professional experience in full stack development, with a strong focus on Angular, .NET, and .NET Core
  • Very strong expertise in developing and integrating RESTful APIs, with a deep understanding of asynchronous request handling
  • Strong understanding of technology architectures, programming, databases, and cloud computing
  • Cloud platform-agnostic skills are preferred, enabling flexibility in technology selection
  • Excellent leadership, communication, and interpersonal skills to effectively manage teams and collaborate with stakeholders
  • Ability to identify problems, analyze data, and develop effective solutions that meet business needs
  • Proven experience in managing multiple projects simultaneously, overseeing implementation, and ensuring successful delivery
  • Ability to think strategically, develop long-term plans, and make decisions that align with business objectives
Job Responsibility
Job Responsibility
  • Develop and implement technology transformation strategies that align with business goals
  • Identify areas for improvement and propose innovative technologies to enhance operational efficiency
  • Design and oversee the implementation of new architectures across application, data, integration, and security domains
  • Lead the design and delivery of technology solutions that meet business needs and adhere to industry standards
  • Collaborate with cross-functional teams and clients to understand requirements and translate them into effective technical solutions
  • Evaluate and recommend new technologies, tools, and platforms to support business transformation efforts
  • Promote the culture of continuous improvement, innovation and upskilling in the team
  • Oversee the implementation of new technologies and solutions, managing project timelines and budgets to ensure successful delivery across multiple projects simultaneously
  • Continuously monitor and optimize technology performance, identifying areas for improvement and implementing strategies to enhance efficiency
  • Provide mentorship and guidance to junior engineers and team members
What we offer
What we offer
  • Support for professional accreditations such as ACCA and study leave
  • Flexible arrangements, generous holidays, birthday leave
  • Continuous mentoring along your career progression
  • Active sports, events and social committees across our offices
  • Support with mental, physical, emotional and financial support 24/7 from our Employee Assistance Program
  • The opportunity to invest in our growth and success through our Employee Share Plan
  • Plus additional local benefits depending on your location
Read More
Arrow Right

Software Engineer Sr Staff - Platforms Developer

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or master’s degree in computer science, electronics, telecommunication engineering, or a related discipline
  • 14 to 19 years of experience in networking and system software development
  • Proficiency in C and C++ programming
  • Familiarity with data structures and system debugging techniques
  • Expertise in Host Complex, System Peripherals & Drivers: CPU complex (x86)
  • PCIe, SPI, I2C, MDIO
  • FPGA, CPLD, Flash Drivers
  • Expertise in Ethernet Interfaces (ranging from 1Gig to 400G+, including 800G, 1.6T), MacSec, Timing, Optics (SFP, QSFP, QDD, OSFP)
  • Expertise in High-speed packet forwarding with network processors, PHYs, and SerDes
  • Cloud Architectures
Job Responsibility
Job Responsibility
  • Collaborate with product managers, architects, and other engineers to define software requirements and specifications
  • Design, implement, and maintain networking and system software components using C and C++ programming languages
  • Conduct object-oriented analysis and design to ensure robust and scalable solutions
  • Debug complex system-level issues, leveraging your deep understanding of fundamental OS concepts (especially in Linux or similar operating systems)
  • Participate in hardware and system-level design discussions, ensuring carrier-class software development
  • Work with Linux device drivers, system bring-up, and the Linux kernel
  • Navigate large codebases effectively
  • Apply strong technical, analytical, and problem-solving skills to enhance software performance and resilience
  • Utilize scripting technologies and modern DevOps practices
  • Collaborate with cross-functional teams, including networking, embedded platform software, and hardware experts
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

Corporate Tools is looking for a Site Reliability Engineer. You will be a tradit...
Location
Location
United States
Salary
Salary:
175000.00 USD / Year
corporatetools.com Logo
Corporate Tools
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Software Engineering, or equivalent practical experience
  • 5+ years of experience in software engineering
  • 2+ years of experience in site reliability engineering, DevOps, or infrastructure engineering roles
  • Deep experience with cloud platforms (AWS, Azure, or GCP) and infrastructure as code tools such as Terraform, CloudFormation, or Pulumi
  • Strong proficiency with Kubernetes, Docker, and container orchestration in production environments
  • Hands-on experience with observability and monitoring tools like Prometheus, Grafana, OpenTelemetry, Sentry, or New Relic
  • Proven ability to design and implement highly available, fault-tolerant systems and lead proactive incident response efforts
  • Experience with performance tuning, database optimization, and caching strategies (e.g., PostgreSQL, Redis, Memcached)
  • Demonstrated ability to drive reliability improvements, reduce operational toil, and foster a culture of resilience and continuous improvement
  • Experience leading reliability-focused initiatives such as post-incident reviews, capacity planning, and root cause analysis
Job Responsibility
Job Responsibility
  • Stop problems before they start
  • Fix issues quickly and learn from them
  • Help keep systems steady, secure, and running
  • Work closely with DevOps engineers to build out tools and automation
  • Take ownership
What we offer
What we offer
  • 100% employer-paid medical, dental and vision for employees
  • Annual review with raise option
  • 22 days Paid Time Off accrued annually, and 4 holidays
  • After 3 years, PTO increases to 29 days
  • Employees transition to flexible time off after 5 years with the company—not accrued, not capped, take time off when you want
  • Paid Parental Leave
  • Up to 6% company matching 401(k) with no vesting period
  • Quarterly allowance
  • Open concept office with friendly coworkers
  • Creative environment where you can make a difference
  • Fulltime
Read More
Arrow Right

Founding Engineering Manager

Our company is seeking an experienced Software Engineering Manager to lead and e...
Location
Location
United States , Edgewood, NY
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of detail oriented software engineering experience
  • 3+ years leading or managing software engineering teams
  • Proven record in building, scaling, or transforming engineering organizations
  • Hands-on experience owning and supporting production systems
  • Deep understanding of modern software development and operational practices (e.g., Git, CI/CD, DevOps, cloud infrastructure, software lifecycle)
  • Strong leadership, communication, and organizational capabilities
  • Experience partnering with cross-functional teams and business stakeholders
  • Ability to commute to Long Island, NY under a hybrid work model
Job Responsibility
Job Responsibility
  • Engineering Organization Leadership Establish and oversee the engineering organization, including hiring, mentoring, and managing a multi-disciplinary team
  • Cultivate a culture based on ownership, accountability, operational excellence, and continual improvement
  • Develop and refine engineering processes, organizational structure, and technical best practices
  • Align engineering objectives with business strategy in partnership with senior leaders and stakeholders
  • Production Reliability & Operational Ownership Champion the reliability and stability of production systems supporting critical operations
  • Implement and manage support processes, including incident response, on-call rotations, and root-cause analysis
  • Develop and execute operational excellence programs emphasizing monitoring, scalability, and performance optimization
  • Foster a sense of engineering accountability for production system continuity and resilience
  • Global Engineering Collaboration Drive effective coordination and communication with distributed engineering teams in a global technology environment
  • Synchronize development and operational efforts across regions, ensuring alignment and best practices
What we offer
What we offer
  • medical, vision, dental, and life and disability insurance
  • eligible to enroll in our company 401(k) plan
Read More
Arrow Right
New

Sr sre

Location
Location
India , Putlibowli
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
March 30, 2026
Flip Icon
Requirements
Requirements
  • Develop and maintain Infrastructure as Code (IaC) using tools like Terraform, Ansible, Dynatrace
  • Build and manage CI/CD pipelines
  • Improve infrastructure provisioning and configuration through automation
  • Monitor the health, performance, and reliability of production systems and applications
  • Design, implement, and maintain automated monitoring solutions, using tools such as Datadog
  • Define and monitor service level objectives (SLOs), service level indicators (SLIs), and error budgets
  • Implement effective alerting systems
  • Lead root cause analysis (RCA) and post-mortem investigations
  • Respond to production incidents, diagnose root causes, and implement corrective actions
  • Create and maintain playbooks and documentation for incident response
Job Responsibility
Job Responsibility
  • Develop and maintain Infrastructure as Code (IaC) using tools like Terraform, Ansible, Dynatrace to automate deployment and management of infrastructure
  • Build and manage CI/CD pipelines to ensure efficient and reliable application deployments
  • Improve infrastructure provisioning and configuration through automation, minimizing manual interventions and reducing human error
  • Monitor the health, performance, and reliability of production systems and applications
  • Design, implement, and maintain automated monitoring solutions, using tools such as Datadog
  • Define and monitor service level objectives (SLOs), service level indicators (SLIs), and error budgets to ensure system reliability and availability meet customer expectations
  • Implement effective alerting systems to identify and address potential issues before they impact users
  • Lead root cause analysis (RCA) and post-mortem investigations after incidents to identify improvements and avoid recurrence
  • Respond to production incidents, diagnose root causes, and implement corrective actions
  • Create and maintain playbooks and documentation for incident response, troubleshooting, and recovery processes
  • Fulltime
Read More
Arrow Right

Senior Cloud Engineering Manager

Location
Location
Poland , Warszawa
Salary
Salary:
210.00 - 250.00 PLN / Hour
devire.pl Logo
Devire
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of professional experience in software engineering or infrastructure engineering, with deep expertise in public cloud environments (preferably AWS)
  • Proven leadership experience as an Engineering Manager, Technical Lead, or similar role overseeing cloud-native or platform engineering teams
  • Strong knowledge of cloud architecture principles, distributed systems design, microservices, and container orchestration technologies (e.g., Docker, Kubernetes)
  • Demonstrated experience building secure, scalable, and highly available systems within large, enterprise-scale environments
  • Experience designing and operating hybrid cloud architectures integrating public cloud with on-premise systems
  • Solid understanding of DevOps practices, CI/CD pipelines, Infrastructure as Code (e.g., Terraform, CloudFormation), and automation frameworks
  • Bachelor’s degree in Computer Science, Engineering, or a related technical discipline (or equivalent practical experience)
  • Excellent leadership, stakeholder management, and communication skills, with the ability to influence at both technical and executive levels
Job Responsibility
Job Responsibility
  • Lead, mentor, and scale a team of cloud engineers, fostering technical excellence, accountability, and continuous improvement
  • Define and execute the roadmap for reusable cloud platform capabilities, reference architectures, and standardized deployment patterns
  • Drive enterprise adoption of public cloud platforms (primarily AWS) by enabling seamless migration of legacy systems and supporting the development of cloud-native applications
  • Establish and promote a streamlined, developer-centric cloud onboarding experience to accelerate time-to-market
  • Ensure all cloud solutions are architected with a security-first and compliance-driven mindset, meeting regulatory and internal governance standards
  • Implement architectural best practices to guarantee high availability, resilience, scalability, and performance across cloud environments
  • Collaborate with cross-functional platform, security, and infrastructure teams to align on long-term cloud strategy and operating models
  • Provide executive-level visibility into cloud adoption progress, highlighting milestones, risks, and mitigation strategies
  • Engage directly with internal stakeholders to gather feedback, prioritize enhancements, and continuously improve platform capabilities
  • Oversee cloud governance processes, ensuring production deployments adhere to enterprise standards and regulatory requirements
What we offer
What we offer
  • Private HealthCare
  • Sports card
  • Life insurance
  • Working for a leading corporation with a stable market position
  • Working in the international environment
  • Hybrid work (3 days a week office work) with supportive and positive environment
  • Fulltime
Read More
Arrow Right

Director of Engineering

The Director of Engineering leads K12 Tutoring’s platform engineering organizati...
Location
Location
United States
Salary
Salary:
132000.00 - 199000.00 USD / Year
stridelearning.com Logo
Stride, Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, engineering, or related field, or equivalent practical experience
  • 10+ years of progressive experience in software engineering roles, including senior leadership of multi-team organizations
  • Proven experience leading engineering teams at scale (20+ staff and contractors) delivering complex, customer-facing platforms
  • Demonstrated success driving Agile delivery in high-growth or transformation environments
  • Strong background in modern cloud-based architectures, DevOps practices, and distributed systems
  • Experience building secure, compliant systems in regulated environments (e.g., education, healthcare, fintech, or government)
  • Track record of shipping new product capabilities—including data-driven or AI-enabled features—into production
  • Excellent people leadership skills, including coaching managers, building performance cultures, and developing technical leaders
  • Experience establishing engineering metrics and operating models that improve velocity, quality, and predictability
  • Strong strategic thinking paired with hands-on execution orientation
Job Responsibility
Job Responsibility
  • Lead and scale a distributed organization of Software Engineers, QA professionals, and Scrum Masters, fostering a culture of accountability, learning, and high performance
  • Own delivery execution across multiple Agile teams, ensuring predictable outcomes aligned to business priorities, customer commitments, and regulatory requirements
  • Partner with Product, Business, Data Engineering, AI, Information Security, and Operations leaders to shape roadmaps, scope initiatives, and deliver differentiated platform capabilities
  • Set and evolve engineering operating rhythms, including sprint planning, release management, dependency coordination, and continuous improvement practices
  • Drive architectural standards and technical strategy across the platform, ensuring solutions are secure, compliant, performant, accessible, and scalable
  • Make strategic trade-offs between short-term feature delivery and long-term technical investments, including identifying, prioritizing, and addressing technical debt to reduce risk and ensure sustainable scalability and efficiency
  • Promote adoption of engineering best practices while maintaining agility and avoiding over-engineering
  • Thrive in fast-paced, high-growth startup environments, balancing hands-on technical guidance with team leadership and execution
  • Establish and track engineering metrics such as delivery throughput, quality, reliability, security posture, and team health
  • use data to improve outcomes
What we offer
What we offer
  • health benefits
  • retirement contributions
  • paid time off
  • Fulltime
Read More
Arrow Right