This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for a skilled and proactive DevOps Engineer to support high-scale distributed systems for a strategic engagement with Capital One. The ideal candidate will play a critical role in infrastructure management, system reliability, monitoring, automation, and incident response across cloud-based environments.This role requires strong operational excellence, automation mindset, and the ability to work closely with multiple product engineering teams.
Job Responsibility:
Monitor production systems and proactively identify performance or reliability issues
Design and implement strategies to detect, troubleshoot, and resolve system issues
Build automated solutions for operational support and incident remediation
Manage and maintain infrastructure supporting multiple product teams
Handle incident management processes, including root cause analysis and post-mortem documentation
Collaborate with product engineering teams to ensure DevOps and reliability best practices are followed
Support distributed, multi-service application environments
Conduct application performance analysis and tuning
Contribute to CI/CD pipeline improvements and release management processes
Requirements:
4–8 years of hands-on DevOps / SRE experience
Strong experience with at least one Cloud platform: AWS (Preferred) or Azure / GCP
Strong Linux scripting skills (Bash / sh / zsh)
Basic programming knowledge in: Python (Preferred) or Java or JavaScript