This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Business Operations team is seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to join our team. You will play a critical role in ensuring the reliability, scalability, and performance of our applications, supporting essential services that power Mastercard's global operations. As a thought leader in your field, you will bring technical expertise, a passion for automation, and the ability to mentor. The role of the Business Operations Site Reliability Engineer is to be the production readiness steward for Mastercard products. As Business Operations SRE, we are responsible for ensuring that our platform is stable and healthy. We break down barriers to running our products by fostering developer run ownership and empowering developers to build resilient products. We support our developers during the application build phase in software run principles that include operational design, automation, capacity planning, and monitoring that leads to fault-tolerant, scalable products. We see the big picture and help create and enforce operations standards while facilitating an agile and learning culture. We support daily operations with a hyper focus on triage, root cause by understanding the business impact of our products and subsequently performing blameless post-mortems. The goal of every Business Operations team is to engage early in the development lifecycle to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Business Operations teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. Ultimately, the role of Business Operations is to align Product and Customer Focused priorities with Operational needs by providing continuous feedback throughout the lifecycle.
Job Responsibility
Independently execute key elements of projects/processes within the Site Reliability Engineering area by applying in-depth knowledge of their discipline and area best practices to effectively resolve problems and roadblocks as they occur
Assist in evaluating operational requirements and developing technical solutions within existing frameworks
Support automation and scripting efforts to improve operational workflows and incident response processes
Troubleshoot and resolve routine and some complex system issues, escalating when necessary to maintain system health
Contribute to documentation, knowledge sharing, and best practices to enhance team operational procedures
Collaborate with development teams and stakeholders to ensure reliability solutions align with technical and business needs
Participate in reviews and quality assurance activities to uphold system stability standards
May contribute to solution development for new products/services and/or manage smaller project/initiatives as an experienced individual contributor with specialized knowledge within the Site Reliability Engineering area
Requirements
Observability
Programming and Scripting
Systems and Network Administration
Cloud Computing and Infrastructure
Reliability and Scalability
DevOps Practices
Troubleshooting
Capacity Planning and Performance Optimization
IT Service Management
Proactive Monitoring and Improvement (SRE Applications)
Corporate Security Responsibility
Abide by Mastercard’s security policies and practices
Ensure the confidentiality and integrity of the information being accessed
Report any suspected information security violation or breach
Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines