This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a Staff Site Reliability Engineer, you will be a technical leader and strategist within the SRE function, shaping the future of Airwallex’s reliability and infrastructure landscape. You’ll partner closely with product and engineering leadership to define the SRE roadmap, driving large-scale infrastructure projects and reliability initiatives that have global impact. You will mentor senior engineers, lead cross-team collaborations, and influence engineering culture and processes across the company. You’ll own the architecture and delivery of highly complex infrastructure systems, lead incident management at the highest level, and champion automation, observability, and operational excellence across critical services.
Job Responsibility:
Drive the strategic vision and roadmap for Site Reliability Engineering at Airwallex, aligned with business objectives and product goals
Architect and oversee the implementation of highly scalable, secure, and resilient cloud infrastructure for new services and platform-wide initiatives
Lead and mentor senior engineers and cross-functional teams in reliability engineering best practices, automation, and incident management
Champion and evolve operational excellence through advanced observability, SLO management, runbooks, and proactive risk mitigation
Lead incident response for high-severity incidents, facilitating post-mortems and driving continuous improvements
Collaborate closely with Product, Engineering, Security, and DevOps leadership to ensure compliance, resilience, and alignment across functions
Influence and shape engineering culture around reliability, scalability, and DevOps principles across multiple teams
Advocate for innovation in tooling, automation, and infrastructure to improve developer productivity and service uptime
Requirements:
10+ years of experience in SRE, DevOps, or infrastructure engineering roles, with progressive responsibility
Proven ability to lead SRE strategy and execution for large-scale, complex, cross-functional projects
Deep expertise with cloud platforms (AWS/GCP), Kubernetes, container orchestration, observability, and incident response frameworks
Strong experience supporting production systems with stringent high availability, compliance, and security requirements
Demonstrated leadership in mentoring and growing technical teams
Excellent collaboration and communication skills, able to influence stakeholders at all levels
Degree in Computer Science or related field
Nice to have:
Experience in a fintech or similarly regulated industry
Familiarity with data streaming, analytics pipelines, or financial data systems