This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Business Operations (Biz Ops) team is seeking a Business Operations Site Reliability Engineer (SRE) team Manager. The role of Business Operations Organization is to be the production readiness steward for Mastercard products. As a Business Operations SRE, we are responsible for ensuring that our platform is stable and healthy. We break down barriers to run our products by fostering developer run ownership and empowering developers to build resilient products. We support our developers during the application build phase in software run principals that includes operational design, automation, capacity planning, monitoring that leads to fault-tolerant, scalable products. We see the big picture and help create and enforce operations standards while facilitating an agile and learning culture. We support daily operations with a hyper focus on triage, root cause by understanding the business impact of our products and subsequently performing blameless post-mortems. The goal of every Business Operations team is to engage early in the development lifecycle to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Business Operations teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. Ultimately, the role of Business Operations is to align Product and Customer Focused priorities with Operational needs by providing continuous feedback throughout the lifecycle.
Job Responsibility:
Assist the local team to ensure a particular project is completed to enhance the platform quality
Ensures platform procedures are well documented
Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement
Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
Support services before they go live through activities such as system design consulting, capacity planning and launch reviews
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
Practice sustainable incident response and blameless postmortems
Assurance of high availability for administered systems on the defined level
Work with a global team spread across tech hubs in multiple geographies and time zones
Experience in dealing with difficult situations and making decisions with a sense of urgency is needed
Interest in designing, analyzing and troubleshooting, high available and large-scale distributed systems
We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships
Support daily operations with a hyper focus on triage and then root cause by understanding the business implications of our products
Shift left to be more proactive and upfront in the development process
Proactively manage production and change activities to maximize customer experience, and increase the overall value of supported applications
Streamlining and standardizing traditional application specific support activities
Centralizing points of interaction for both internal and external partners by communicating effectively with all key stakeholders
Align Product and Customer Focused priorities with Operational needs
Automation of BAU tasks
Requirements:
BS degree in Computer Science or related technical field (e.g., physics or mathematics), or equivalent practical experience
Smooth English language
Strong Analytical skills
Experience in the financial industry
Systematic problem-solving approach coupled with communication skills and a sense of ownership and drive
Unix / PERL / BASH advanced
SQL PL/SQL advanced (querying and administration)
Understanding of File Transfer technologies and batch applications
Ability to interact with customers, project teams, suppliers and asking effective, forward thinking questions
Strong desire to learn with a team of professionals
Self-motivated and highly collaborative
Comfortable working in a dynamic and fast-paced environment
Nice to have:
ITIL foundation
Experience with algorithms, data structures, scripting, pipeline management, and software design
Experience in one or more of the following: C, C++, Java, Python, Go, Perl, or Ruby
CI/CD fundamentals
Familiar with monitoring tools, i.e. Splunk, Zabbix
Interest in designing, analyzing, and troubleshooting large-scale distributed systems
Experience in industry standard CI/CD tools like Git/BitBucket, Jenkins, Maven, Artifactory, and Chef
Experience in Way4 and or other OpenWay applications