Site Reliability Engineer

Job Description

We are looking for a Site Reliability Engineer to join our Core team to encourage infrastructure best practices across our organization that would allow to securely scale a distributed financial platform that touches millions of people a day. Our distributed financial platform tackles some of the most interesting problems in the crypto for millions of our customers and continues to grow rapidly. The SRE team at blockchain combines software and systems engineering to provide a platform that abstracts complexity for increased security, reliability and rapid product delivery. As a member of the Core team you will be tasked with developing an in-depth understanding of the infrastructure needs of our products. You will establish and maintain creative engineering solutions to improve our customers’ experience by building necessary tooling. Crucially, you will also guide and educate developer teams so that they can deliver new features in a rapid, secure and scalable manner.

Job Responsibility

Play a critical role in evolving our infrastructure as we develop solutions to complex technical problems involving reliability, latency, bandwidth and most importantly security
Be an integral part of improving observability, monitoring and alerting throughout the platform
Help co-ordinate work across different areas of the company to ensure the most efficient path of execution
Centralize wherever possible common streams of work that are currently duplicated across developer teams
Focus heavily on writing tooling to replace manual, repetitive work in a scalable way
Work in a fast paced, and dynamic environment complementing our existing high calibre team

Requirements

Experience with containerization and service orchestration, including best practices and security
Strong knowledge of at least one programming language
Linux, including an understanding of resource allocation, network and/or internals
Experience working with cloud solutions (GCP or AWS)
Deep understanding and demonstrable experience with modern monitoring tools such as Prometheus, Datadog, Grafana, Telegraf
Experience with infrastructure as code tools
Solid background with configuration management tools
Experience with using GitOps and CI to make changes, preferably Github Actions
Experience with messaging systems such as Kafka
Experience with database management

Nice to have

Experience with Hashicorp Nomad, Consul and Vault is a plus
Experience with Golang, Python, and Bash is a plus
Experience with complex Terraform deployments is a plus
Experience with Saltstack is a plus
Experience working in Data Centers is a plus
Knowledge of routing and switching protocols is a plus

What we offer

Full-time salary based on experience and meaningful equity in an industry-leading company
Hybrid model working from home & awesome office location in the heart of London
Unlimited vacation policy
work hard and take time when you need it
Work from Anywhere Policy: You can work remotely from anywhere in the world for up to 20 days per year
Apple equipment
The opportunity to be a key player and build your career at a rapidly expanding, global technology company in an emerging field
Flexible work culture

Blockchain - All Job Offers

Select Country

Site Reliability Engineer - Core

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?