CrawlJobs Logo

Junior Site Reliability Engineer

accesso.com Logo

accesso

Location Icon

Location:
United Kingdom

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As a Jr. Site Reliability Engineer, you will 'make things scale' which includes supporting delivery and operation of the managed accesso Horizon product in customers’ cloud environments (AWS/Azure/GCP). You will work under mentor guidance to deploy, operate and support customer environments, automate tasks, and learn site reliability and cloud best practices.

Job Responsibility:

  • Assisting with provisioning and deploying accesso Horizon components to customer cloud accounts using Infrastructure as Code (Terraform)
  • Help maintain CI/CD pipelines (GitHub Actions) for application and infrastructure deployments
  • Support monitoring, logging and alerting (Prometheus, Grafana & Coralogix) and respond to basic alerts with supervision
  • Implement and improve basic automation and scripting
  • Participate in incident triage, root cause investigation and follow-up tasks
  • Follow security and compliance requirements for customer cloud environments (identity, secrets, network controls)
  • Produce and maintain operational runbooks, deployment guides and change notes
  • Participate in on-call rotation as a L1 responder
  • Normal workday may require time outside the normal working day
  • Learn and apply accesso Horizon product architecture and configuration

Requirements:

  • Some practical exposure to cloud platforms (AWS/Azure/GCP)—coursework, internships, or self-led projects
  • Ability to self-learn with assistance from Senior Engineers
  • Basic scripting ability using Python or Bash
  • Familiarity with basic Linux systems and general command–line
  • Understanding of Git and basic CI/CD concepts
  • Good written and verbal communication
  • customer-focused approach
  • Ability to work with minimal direction
  • Willingness to learn, take direction and work within a team

Nice to have:

  • Experience with Terraform, Docker, Kubernetes (EKS/AKS/GKE) or monitoring tools
  • Familiarity with security fundamentals (IAM, network ACLs, secrets management)
  • Experience supporting a SaaS or managed service
What we offer:
  • Competitive compensation package including an annual bonus opportunity
  • 8-days of paid bank holiday leave and 26-days of paid annual leave (paid leave increases with tenure)
  • 8 hours of paid Volunteer Time Off
  • Inclusive Family Benefits, including a $7,500 benefit for surrogacy, adoption, and fertility
  • Robust health insurance scheme with the opportunity to participate in private medical scheme after satisfactory performance
  • Matching pension scheme (up to 8%)
  • Unlimited access to Udemy for Business
  • Flexible work schedule

Additional Information:

Job Posted:
December 05, 2025

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Junior Site Reliability Engineer

Lead Site Reliability Engineer

Groupon is a marketplace where customers discover new experiences and services e...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years in systems engineering
  • at least 5+ years in SRE or DevOps roles
  • expertise in cloud platforms (GCP, AWS) and container orchestration (Kubernetes, Docker)
  • proficiency in programming and scripting languages like Python, Go, and Bash
  • advanced knowledge of Infrastructure as Code (IaC) tools such as Terraform and Ansible
  • deep understanding of networking, DNS, load balancing, and security principles
  • proven track record of managing high-availability systems in demanding environments
  • exceptional analytical and problem-solving skills
Job Responsibility
Job Responsibility
  • Architect and maintain fault-tolerant systems, ensuring uptime SLAs of 99.9% or higher
  • drive automation in infrastructure management and deployment using Terraform, Ansible, Kubernetes, and similar tools
  • create and optimize CI/CD pipelines to ensure reliable, secure, and efficient software delivery
  • build and enhance comprehensive observability solutions, including monitoring, logging, and alerting systems using Prometheus, Grafana, and the ELK stack
  • collaborate with stakeholders to define and achieve SLIs, SLOs, and error budgets aligned with business needs
  • lead incident response during on-call rotations, ensuring rapid resolution and root cause analysis for critical issues
  • design and execute performance testing, capacity planning, and scalability strategies for evolving workloads
  • proactively identify and resolve bottlenecks, increasing system performance and developer efficiency
  • mentor junior engineers, fostering a collaborative and growth-oriented team environment
  • guide architectural decisions that drive innovation and enhance system reliability
What we offer
What we offer
  • The opportunity to work with cutting-edge technologies in a transformative environment
  • a collaborative and innovative work values alignment that values your expertise and contributions
  • professional growth and leadership development pathways tailored to your aspirations
  • a chance to leave a lasting impact by shaping the future of reliable and scalable systems
Read More
Arrow Right

Staff Engineer, Site Reliability

LearnUpon is looking for a Staff Site Reliability Engineer to join our team in I...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
learnupon.com Logo
LearnUpon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in a software or Ops role
  • 5+ years of cloud engineering experience, with at least 2 years experience with AWS
  • Experience deploying Microservice environments, using containerisation technologies such as Kubernetes and Docker
  • Experience in designing and implementing Observability tech stacks
  • Have championed the benefits of Observability to Engineering teams
  • Can architect the design of SLO/SLI implementation that balances the needs of different teams
  • Familiar with cost analysis of Observability metrics gathering, Engineering effort, and tooling
  • Experience building and supporting large-scale distributed systems that back a consumer app or website with associated requirements of performance, security and disaster recovery
  • Experience with implementing IaaC (e.g. CloudFormation, Terraform etc.), automation tooling (e.g. Puppet, Ansible etc.), CI/CD (e.g. Jenkins, Travis CI, GitLab etc.)
  • Able to effectively communicate technical ideas to and collaborate with both technical and non-technical peers
Job Responsibility
Job Responsibility
  • Identifying opportunities to improve and scale our infrastructure for performance, observability, maintainability, and cost, by creating innovative solutions
  • Leading our efforts to build an observability function that incorporates application metrics, application transaction tracking, and event log management
  • Driving the processes to maintain resilient, scalable and cost-effective infrastructure
  • Working with other Engineering teams to provide infrastructure solutions that meet their ongoing requirements
  • Building tools focused on measuring, monitoring and alerting, with an eye towards self-service in order to promote Engineers’ ownership of observability
  • Reacting quickly to changing customer and business needs
  • Participate in on-call rota
  • Mentoring junior talent
What we offer
What we offer
  • Work in a fun and supportive environment with regular team events
  • Excellent career progression
  • Structured learning environment
  • Competitive salary and company ESOP
  • Private health insurance
  • 26 days annual leave
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer

We're looking for a Senior Site Reliability Engineer for our Currents team, resp...
Location
Location
United States , Austin
Salary
Salary:
129600.00 - 232200.00 USD / Year
braze.com Logo
Braze
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s in Computer Science, Software Engineering, or a related STEM field
  • Five (5) years of experience in any role/occupation/position involving software engineering or site reliability engineering
  • Experience using distributed systems to deploy and monitor live applications such as Kubernetes or Docker Swarm
  • Experience working with alerting software (Sentry, Datadog, and/or PagerDuty)
  • Experience utilizing programming languages (Java, Kotlin, and/or Ruby) to understand and contribute to the codebase
  • Experience storing data in relational and non-relational databases such as Postgres and MongoDb
  • Experience with data streaming or queuing systems to build data pipelines with technologies like Kafka, Sidekiq or SQS and SNS
  • Experience leveraging continuous integration tools such as Jenkins or Buildkite
  • Experience collaborating with engineers through pull requests and code reviews in version control software such as GitHub or GitLab
Job Responsibility
Job Responsibility
  • Solve live performance and reliability issues and prevent their recurrence
  • Write and review code, educating engineers and building a culture of reliability
  • Practice sustainable incident response and blameless postmortems
  • Define and enable standards for monitoring, reliability, and performance
  • Bridge the gap between infrastructure and platform engineering teams
  • Support and improve services by planning for scale and reliability
  • Guide junior engineers in SRE best practices, software engineering, and agile project leadership
What we offer
What we offer
  • Competitive compensation that may include equity
  • Retirement and Employee Stock Purchase Plans
  • Flexible paid time off
  • Comprehensive benefit plans covering medical, dental, vision, life, and disability
  • Family services that include fertility benefits and equal paid parental leave
  • Professional development supported by formal career pathing, learning platforms, and a yearly learning stipend
  • A curated in-office employee experience, designed to foster community, team connections, and innovation
  • Opportunities to give back to your community, including an annual company-wide Volunteer Week and donation matching
  • Employee Resource Groups that provide supportive communities within Braze
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer

We're looking for a Senior Site Reliability Engineer for our Currents team, resp...
Location
Location
United States , San Francisco
Salary
Salary:
129600.00 - 232200.00 USD / Year
braze.com Logo
Braze
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s in Computer Science, Software Engineering, or a related STEM field
  • Five (5) years of experience in any role/occupation/position involving software engineering or site reliability engineering
  • Experience using distributed systems to deploy and monitor live applications such as Kubernetes or Docker Swarm
  • Experience working with alerting software (Sentry, Datadog, and/or PagerDuty)
  • Experience utilizing programming languages (Java, Kotlin, and/or Ruby) to understand and contribute to the codebase
  • Experience storing data in relational and non-relational databases such as Postgres and MongoDb
  • Experience with data streaming or queuing systems to build data pipelines with technologies like Kafka, Sidekiq or SQS and SNS
  • Experience leveraging continuous integration tools such as Jenkins or Buildkite
  • Experience collaborating with engineers through pull requests and code reviews in version control software such as GitHub or GitLab
Job Responsibility
Job Responsibility
  • Solve live performance and reliability issues and prevent their recurrence
  • Write and review code, educating engineers and building a culture of reliability
  • Practice sustainable incident response and blameless postmortems
  • Define and enable standards for monitoring, reliability, and performance
  • Bridge the gap between infrastructure and platform engineering teams
  • Support and improve services by planning for scale and reliability
  • Guide junior engineers in SRE best practices, software engineering, and agile project leadership
What we offer
What we offer
  • Competitive compensation that may include equity
  • Retirement and Employee Stock Purchase Plans
  • Flexible paid time off
  • Comprehensive benefit plans covering medical, dental, vision, life, and disability
  • Family services that include fertility benefits and equal paid parental leave
  • Professional development supported by formal career pathing, learning platforms, and a yearly learning stipend
  • A curated in-office employee experience, designed to foster community, team connections, and innovation
  • Opportunities to give back to your community, including an annual company-wide Volunteer Week and donation matching
  • Employee Resource Groups that provide supportive communities within Braze
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer

We're looking for a Senior Site Reliability Engineer for our Currents team, resp...
Location
Location
United States , New York City
Salary
Salary:
129600.00 - 232200.00 USD / Year
braze.com Logo
Braze
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s in Computer Science, Software Engineering, or a related STEM field
  • Five (5) years of experience in any role/occupation/position involving software engineering or site reliability engineering
  • Experience must include: Using distributed systems to deploy and monitor live applications such as Kubernetes or Docker Swarm
  • Working with alerting software (Sentry, Datadog, and/or PagerDuty)
  • Utilizing programming languages (Java, Kotlin, and/or Ruby) to understand and contribute to the codebase
  • Storing data in relational and non-relational databases such as Postgres and MongoDb
  • Data streaming or queuing systems to build data pipelines with technologies like Kafka, Sidekiq or SQS and SNS
  • Leveraging continuous integration tools such as Jenkins or Buildkite
  • Collaborating with engineers through pull requests and code reviews in version control software such as GitHub or GitLab
Job Responsibility
Job Responsibility
  • Solve live performance and reliability issues and prevent their recurrence
  • Write and review code, educating engineers and building a culture of reliability
  • Practice sustainable incident response and blameless postmortems
  • Define and enable standards for monitoring, reliability, and performance
  • Bridge the gap between infrastructure and platform engineering teams
  • Support and improve services by planning for scale and reliability
  • Guide junior engineers in SRE best practices, software engineering, and agile project leadership
What we offer
What we offer
  • Competitive compensation that may include equity
  • Retirement and Employee Stock Purchase Plans
  • Flexible paid time off
  • Comprehensive benefit plans covering medical, dental, vision, life, and disability
  • Family services that include fertility benefits and equal paid parental leave
  • Professional development supported by formal career pathing, learning platforms, and a yearly learning stipend
  • A curated in-office employee experience, designed to foster community, team connections, and innovation
  • Opportunities to give back to your community, including an annual company-wide Volunteer Week and donation matching
  • Employee Resource Groups that provide supportive communities within Braze
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

As a member of Kalshi’s engineering team, you’ll help build the next-generation ...
Location
Location
United States , New York
Salary
Salary:
100000.00 - 250000.00 USD / Year
kalshi.com Logo
Kalshi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of software engineering experience
  • Experience designing, building, scaling, and maintaining production services and service-oriented architectures
  • Strong system design, coding, debugging, performance-tuning, and observability skills
  • High-quality coding practices with strong testing discipline
  • Excellent written and verbal communication
  • comfort working transparently across teams
  • Strong interpersonal skills across junior-to-principal engineering levels
  • Ability to think clearly under pressure and dive into any layer of the stack
  • Passion for building an open financial system that connects the world
  • Willingness to participate in on-call rotations and swiftly resolve issues
Job Responsibility
Job Responsibility
  • Improve observability, reliability, and service availability by defining and measuring key metrics
  • Build automation and systems that eliminate toil and reduce operational burden
  • Collaborate with core infrastructure engineers to performance-tune and optimize cloud deployments (Docker, Terraform, Kubernetes, EC2, etc.)
  • Partner with product teams to minimize service disruptions and automate incident response
  • Identify and analyze reliability problems across the stack, designing and implementing software for significant, long-term improvements
  • Mentor engineers and drive a culture where reliability is a core engineering value
  • Write high-quality, well-tested code that supports internal and external customer needs
  • Debug complex technical issues and improve system usability, operability, and diagnosability
  • Review feature designs across the company and ensure security, safety, scalability, and architectural clarity
  • Build and maintain integrations with third-party vendors
What we offer
What we offer
  • equity and benefits
  • Fulltime
Read More
Arrow Right

Principle SRE

The Principal Site Reliability Engineer will be a senior technical expert respon...
Location
Location
India , Pune
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years in software engineering or infrastructure roles
  • at least 5 years focused on reliability engineering or SRE
  • proven experience building and operating fault-tolerant, highly available systems at scale
  • strong knowledge of distributed systems, resiliency patterns (circuit breakers, retries, failover), and disaster recovery strategies
  • expertise across infrastructure (compute, storage, networking), application architecture, databases, and integration patterns
  • ability to troubleshoot complex technical issues across distributed systems and perform deep root cause analysis
  • skilled at working with development, operations, and architecture teams to embed reliability into design and delivery
Job Responsibility
Job Responsibility
  • Drive strategies to improve reliability, maintainability, and scalability across payment flows and platform components
  • conduct deep technical assessments of system architectures, identifying risks and recommending improvements for fault tolerance and disaster recovery
  • act as a senior escalation point for production incidents, lead RCA, and implement permanent fixes to prevent recurrence
  • define and enforce reliability patterns, frameworks, and best practices
  • advocate and implement chaos engineering principles to validate system resilience under real-world failure scenarios
  • design and implement full-stack observability solutions, including metrics, logging, distributed tracing, and alerting
  • develop automation for failover, capacity management, and self-healing mechanisms to reduce operational risk
  • partner with development, infrastructure, and production support teams to embed reliability into the SDLC
  • analyze service risk assessments and production incidents to identify systemic issues and drive long-term improvements
  • promote operational excellence and a mindset of designing for failure across all engineering teams
What we offer
What we offer
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
  • Fulltime
Read More
Arrow Right
New

Senior UI Engineer

Engineer the future of global finance. At Citi, our Tech team doesn’t just suppo...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant progressive experience in backend software development, with a proven track record of owning the design and delivery of complex, large-scale software projects as a senior individual contributor
  • Deep, hands-on expertise and architectural understanding of enterprise-level middleware technologies including Java, Spring Boot, Kafka, Microservices architecture, GraphQL, and NoSQL databases. Demonstrated experience with high-volume, low-latency distributed systems. Experience with Apache Flink is a significant advantage
  • Demonstrated ability to architect, design, and implement highly scalable, resilient, secure, and performant distributed systems
  • Expert-level understanding of the modern Software Development Lifecycle (SDLC), CI/CD pipelines, DevSecOps, and Site Reliability Engineering (SRE) best practices, with extensive, hands-on experience in their practical application
  • Exceptional verbal and written communication skills, capable of articulating complex technical concepts to diverse audiences and working effectively with business and technical stakeholders to drive positive outcomes
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field is required
  • a Master’s degree or equivalent advanced professional qualifications are a plus
Job Responsibility
Job Responsibility
  • Actively contribute to and uphold the long-term technical vision and architectural roadmap for core platforms within the Banking Technology middleware ecosystem, encompassing Java, Spring Boot, Kafka, Microservices, GraphQL, and NoSQL databases. Implement and advocate for organizational engineering standards, best practices, and architectural patterns to ensure scalability, reliability, security, and maintainability across all engineering initiatives
  • Lead by example in coding, design, and problem-solving. Mentor and provide technical guidance to senior and junior software engineers, fostering a culture of innovation, continuous learning, and technical excellence. Share knowledge, best practices, and innovative solutions with the team
  • Apply and champion DevSecOps and Site Reliability Engineering (SRE) principles in daily work, ensuring high standards of system availability, performance, security, and operational efficiency for critical production platforms. Proactively identify and address technical debt, mitigate system risks, and implement robust disaster recovery capabilities
  • Act as a primary technical advisor to senior business leaders and product owners, translating complex business requirements into clear, actionable technical designs and innovative solutions. Effectively articulate technical insights, architectural decisions, and development progress to diverse audiences
  • Actively drive the exploration, evaluation, and hands-on application of emerging technologies, advanced architectural patterns, and innovative solutions (e.g., Apache Flink, Artificial Intelligence) to enhance product offerings and improve engineering productivity
  • Collaborate extensively with other engineers and technical leads across engineering, product management, and operations to ensure alignment of technical designs, seamless integration of solutions, and achievement of broader organizational goals. Influence technical decisions through deep expertise and well-reasoned arguments
  • Take ultimate accountability for the successful, on-time delivery of complex, high-quality, and user-centric software components. Uphold rigorous engineering standards through thorough design, code, and security reviews, and contribute to comprehensive technical documentation, ensuring a culture of engineering excellence
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right