CrawlJobs Logo
Briefcase Icon
Category Icon

Reliability Engineer I Jobs

559 Job Offers

Filters
Site Reliability Engineer / Observability Engineer
Save Icon
Join Rackspace as a Site Reliability/Observability Engineer in Giza. Utilize your senior-level SRE and DevOps expertise with AWS and tools like Datadog or Splunk. You will build scalable systems, implement observability solutions, and automate deployments to ensure optimal application performance...
Location Icon
Location
Egypt , Giza
Salary Icon
Salary
Not provided
rackspace.com Logo
Rackspace
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join a leading consultancy and work with Turkey's top companies as a Senior Site Reliability Engineer. You will design fault-tolerant systems, define SLOs/SLAs, and drive automation using Kubernetes and cloud technologies. This role offers career growth, certification opportunities, and hands-on ...
Location Icon
Location
Turkey , İstanbul
Salary Icon
Salary
Not provided
padran.com Logo
Padran Information Technologies Inc.
Expiration Date
Until further notice
Lead Service Reliability Engineer
Save Icon
Lead Service Reliability Engineer role in Singapore. Drive technical excellence using Python, Golang, and SRE principles. Enhance reliability with Kubernetes, Terraform, and advanced observability tools. Grow your career with tailored development programs in a collaborative environment.
Location Icon
Location
Singapore , Singapore
Salary Icon
Salary
Not provided
thoughtworks.com Logo
Thoughtworks
Expiration Date
Until further notice
Senior Service Reliability Engineer
Save Icon
Join Thoughtworks in Singapore as a Senior Service Reliability Engineer. You will champion SRE principles, focusing on automation, resilience, and system performance using Ansible, Terraform, and cloud platforms. Drive reliability improvements, manage incidents, and enhance observability within a...
Location Icon
Location
Singapore , Singapore
Salary Icon
Salary
Not provided
thoughtworks.com Logo
Thoughtworks
Expiration Date
Until further notice
Reliability Engineer
Save Icon
Join our UK team as a Reliability Engineer, performing advanced condition monitoring using vibration analysis and thermography. An engineering background in marine, rail, or manufacturing is ideal, with a Level 1 Vibration certification. Enjoy benefits like a car allowance, flexible working, and ...
Location Icon
Location
United Kingdom
Salary Icon
Salary
37000.00 GBP / Year
besgroup.com Logo
BES Group
Expiration Date
Until further notice
Site Reliability Engineer III
Save Icon
Join Zuora's Cloud Engineering team as a Site Reliability Engineer III in Chennai. You will ensure high-availability for SaaS platforms using AWS, Terraform, and Jenkins. This role requires 6-8 years of SRE experience with strong Linux and Python skills. We offer competitive compensation, flexibl...
Location Icon
Location
India , Chennai
Salary Icon
Salary
Not provided
zuora.com Logo
Zuora
Expiration Date
Until further notice
Software Engineer - Data Infra Reliability
Save Icon
Join Luma AI in Palo Alto as a Data Infra Reliability Engineer. You will ensure the resilience of petabyte-scale data pipelines using SRE principles. Key requirements include deep expertise in Terraform, Kubernetes, Python, and AWS/GCP. Automate infrastructure, harden data workflows, and define S...
Location Icon
Location
United States , Palo Alto
Salary Icon
Salary
220000.00 - 280000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join Luma AI to architect the physical and digital foundation of AGI. As a Site Reliability Engineer, you will build and optimize massive-scale, multi-vendor GPU supercomputers in Palo Alto or London. Your elite HPC knowledge will design high-performance clusters, optimizing low-level networking ...
Location Icon
Location
United States; United Kingdom , Palo Alto; London
Salary Icon
Salary
170000.00 - 360000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Software Engineer - Cloud FinOps & Reliability
Save Icon
Join our SRE team as a foundational Software Engineer specializing in Cloud FinOps & Reliability. You will own the financial health of a massive multi-cloud GPU infrastructure, using Python to build automation and optimize costs. This Palo Alto-based role requires deep cloud expertise and a passi...
Location Icon
Location
United States , Palo Alto
Salary Icon
Salary
120000.00 - 255000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Software Engineer - Reliability
Save Icon
Join Luma as a Software Engineer - Reliability in Palo Alto. Architect and scale next-gen AI infrastructure across AWS and OCI. Utilize your deep Linux and system performance expertise to ensure high availability for GPU clusters. Thrive in a fast-paced role solving complex hardware/software chal...
Location Icon
Location
United States , Palo Alto
Salary Icon
Salary
170000.00 - 360000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Software Engineer - Reliability GPU Infrastructure
Save Icon
Shape the future of creative AI as a Software Engineer for GPU Infrastructure at Luma AI. You will architect and own our massive-scale, multi-cloud and on-premise compute substrate. This role requires deep expertise in distributed systems and infrastructure as code, based in Palo Alto or London.
Location Icon
Location
United States; United Kingdom , Palo Alto; London
Salary Icon
Salary
170000.00 - 360000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Product Infrastructure Engineer - Site Reliability
Save Icon
Join Zyphra as a Product Infrastructure Engineer - Site Reliability in Palo Alto. Design and maintain robust, scalable systems for ML workloads using IaC tools like Terraform. Ensure reliability, security, and observability while collaborating with ML and DevOps teams. Enjoy comprehensive benefit...
Location Icon
Location
United States , Palo Alto
Salary Icon
Salary
Not provided
zyphra.com Logo
Zyphra
Expiration Date
Until further notice
Executive Principal, Site Reliability Engineering (SRE) – DevOps
Save Icon
Lead our Site Reliability Engineering (SRE) and DevOps strategy as an Executive Principal in Irvine. You will guide multiple infrastructure teams, ensuring 24x7 operational excellence in a complex hybrid environment. This senior role requires deep expertise in automation, CI/CD, and platform reli...
Location Icon
Location
United States , Irvine
Salary Icon
Salary
180000.00 - 210000.00 USD / Year
haeaus.com Logo
Hyundai AutoEver America
Expiration Date
Until further notice
Principal Site Reliability Engineer (AI-first SRE)
Save Icon
Lead the AI-driven reliability transformation at Groupon as a Principal SRE. You will architect self-healing systems using AI/ML, GCP/AWS, and Kubernetes to ensure 99.9%+ availability. This role requires 10+ years of experience, expertise in AIOps, and offers a chance to shape scalable, predictiv...
Location Icon
Location
Salary Icon
Salary
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join Dropbox as a Site Reliability Engineer to shape global infrastructure strategy. You'll design secure, scalable systems using AWS, Linux, and Terraform, while driving automation and observability projects. This role offers competitive benefits like flexible PTO and a comprehensive perks allow...
Location Icon
Location
Salary Icon
Salary
Not provided
dropbox.com Logo
Dropbox
Expiration Date
Until further notice
Junior Software Reliability & Safety Engineer
Save Icon
Seeking a Junior Software Reliability & Safety Engineer in Montreal or Toronto. This role requires a Bachelor's in Computer Science and 2+ years' experience with C/C++/ADA and embedded systems. You will apply functional safety standards (ISO 26262, EN 50128) in a collaborative, engineering-focuse...
Location Icon
Location
Canada , Montreal or Toronto
Salary Icon
Salary
Not provided
sector-group.net Logo
Sector Group
Expiration Date
Until further notice
Senior Database Reliability Engineer
Save Icon
Join our team as a Senior Database Reliability Engineer (DBRE) to manage and automate our critical cloud database infrastructure in the US. You'll leverage 7+ years of RDBMS experience, cloud expertise (AWS/GCP/Azure), and coding skills (Python, SQL, Terraform) to reduce toil and support new prod...
Location Icon
Location
United States
Salary Icon
Salary
146000.00 - 162000.00 USD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Senior Database Reliability Engineer
Save Icon
Join our team in Mississauga as a Senior Database Reliability Engineer. You will manage and automate our critical cloud database infrastructure on AWS/GCP/Azure. Leverage your 7+ years of DB experience and skills in SQL, automation (Terraform, Ansible), and coding (Python, PowerShell). Enjoy comp...
Location Icon
Location
Canada , Mississauga
Salary Icon
Salary
136900.00 - 152100.00 CAD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join a leading team as a Senior Site Reliability Engineer in a hybrid London role. You will be the SME for Kubernetes and GCP, driving SRE culture while supporting applications at scale. This cloud-native position offers up to £95k, a strong pension, and bonus schemes for an expert in SRE and sof...
Location Icon
Location
United Kingdom , London
Salary Icon
Salary
85000.00 - 95000.00 GBP / Year
morson.com Logo
Morson Talent
Expiration Date
Until further notice
Senior Database Reliability Engineer
Save Icon
Join our multidisciplinary team in Levallois-Perret as a Senior Database Reliability Engineer. Ensure high availability for PostgreSQL/Aurora, Kafka, and Couchbase in a cloud-native, "you build it, you run it" environment. Leverage your 6+ years of expertise in SRE, IaC, and performance optimizat...
Location Icon
Location
France , Levallois-Perret
Salary Icon
Salary
Not provided
doctolib.fr Logo
Doctolib
Expiration Date
Until further notice

About the Reliability Engineer I role

Explore Reliability Engineer I jobs and launch a career dedicated to ensuring system integrity and operational excellence. A Reliability Engineer I is an entry-level professional focused on proactively preventing failures, optimizing performance, and maximizing the uptime of critical systems. This foundational role exists across diverse industries, from technology and software to manufacturing, energy, and industrial operations. While the specific systems vary—encompassing software applications, cloud infrastructure, or physical machinery like rotating equipment—the core mission is universal: to build and maintain resilient, efficient, and dependable operations through engineering principles.

Individuals in these roles typically engage in a blend of monitoring, analysis, automation, and collaboration. Common responsibilities include assisting in the design and implementation of monitoring and alerting systems to gain visibility into system health. They analyze performance data, incident reports, and maintenance records to identify patterns and potential points of failure. A significant part of the role involves contributing to automation efforts, writing scripts to manage infrastructure, deploy applications, or streamline repetitive operational tasks, thereby reducing manual toil and human error. Reliability Engineers also participate in incident response, helping to diagnose and resolve issues, and contribute to post-incident reviews to document root causes and implement preventive measures. They often work closely with development and operations teams to advocate for reliability standards, such as scalable architecture and robust fault tolerance, throughout a system's lifecycle.

Typical skills and requirements for Reliability Engineer I positions include a strong foundational understanding of engineering concepts, often supported by a bachelor's degree in computer science, engineering, or a related technical field. Key technical proficiencies often include scripting or programming languages (like Python, Bash, or Shell), familiarity with operating systems, and an introductory knowledge of relevant domain tools. For software-centric roles, this might mean basic knowledge of cloud platforms (AWS, Azure, GCP), containerization (Docker, Kubernetes), and CI/CD pipelines. For industrial roles, understanding mechanical systems, statistical analysis, and reliability methodologies (like Root Cause Analysis or Failure Mode and Effects Analysis) is crucial. Regardless of the domain, successful candidates demonstrate a problem-solving mindset, a passion for automation, keen analytical abilities, and effective communication skills to collaborate across teams.

Pursuing Reliability Engineer I jobs is ideal for those who enjoy the intersection of development and operations, possess a meticulous attention to detail, and derive satisfaction from building systems that users and businesses can depend on every day. It is a career path built on continuous learning and offers a clear trajectory for growth into more senior engineering and specialist positions.

Filters

×
Countries
Category
Location
Work Mode
Salary