Senior Site Reliability Engineer Job at Affirm

Job Description

Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. Site Reliability Engineering at Affirm is a small, yet crucial, team that helps our Engineering partners to “Operate What They Own” with excellence to protect their customers’ experience. SRE accomplishes this through defining frameworks and best practices for operating applications, building tooling, and providing training and consulting.

Job Responsibility

You will be responsible for owning and delivering quarterly goals for your team, leading engineers on your team through ambiguity to solve open-ended problems, and ensuring that everyone is supported throughout delivery
You will support your peers and stakeholders in the product development lifecycle by collaborating with infrastructure, product management, developer experience & analytics by participating in ideation, articulating technical constraints, and partnering on decisions that properly consider risks and trade-offs
You will proactively identify technical solutions and operational processes that strengthen incident readiness, response, and post-incident analysis
You will support the operations and availability of your team’s artifacts by creating and monitoring metrics, escalating when needed, and supporting “keep the lights on” & on-call efforts
You will foster a culture of quality and ownership on your team by setting or improving code review and design standards for your team, and advocating for them beyond your team through your writing and tech talks
You will help develop talent on your team by providing feedback and guidance, and leading by example

Requirements

4+ years of experience designing, developing and launching backend systems at scale using scripting and development languages like Bash, Python or Kotlin
A track record of developing highly available distributed systems using technologies like AWS, MySQL and Kubernetes
Meaningful experience contributing in or driving parts of the Incident Lifecycle process, enabling actionable insights that improve the quality culture, reliability, resilience, and system performance
4+ years working in a Site Reliability or Production Engineering team
Experience defining a technical plan for the delivery of a significant feature or system component with an elegant, simple and extensible design
Experience in making impactful changes in a large code base, and have developed a suite of tools and practices that enable you and your team to do so safely
Strong verbal and written communication skills that support effective collaboration with our global engineering team
On-Call Rotation - There would be an on-call rotation for this role as a requirement

Nice to have

Demonstrate curiosity with empathy, and strong opinions loosely held
Your experience demonstrates that you take ownership of your growth, proactively seeking feedback from your team, your manager, and your stakeholders

What we offer

Flexible Spending Wallets for tech, food and lifestyle
Away Days - wellness days to take off work and recharge
Learning & Development programs
Parental benefit
Employee Resource & Community Groups
Health care coverage - Affirm covers all premiums for all levels of coverage for you and your dependents
Flexible Spending Wallets - generous stipends for spending on Technology, Food, various Lifestyle needs, and family forming expenses
Time off - competitive vacation and holiday schedules allowing you to take time off to rest and recharge
ESPP - An employee stock purchase plan enabling you to buy shares of Affirm at a discount

Affirm - All Job Offers

Select Country

Senior Site Reliability Engineer

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?