CrawlJobs Logo

Reliability Engineer

United States, Philadelphia · Job Posted May 30, 2026
Apply Position
Job Link Share

Job Responsibility

  • Deploy, configure, and support Proscia’s container based application stack in on-premise customer environments
  • Own system reliability across customer installations, including uptime, performance, backup/recovery, and upgrade workflows
  • Diagnose and resolve production incidents—deep root cause analysis across application, container, host, storage, and networking layers, using AI alongside traditional debugging to correlate signals and cut through noise
  • Optimize performance for large image datasets and AI workloads running on customer-managed compute infrastructure
  • Improve installation automation, configuration management, and repeatability across diverse environments integrating agentic workflows in your day-to-day to keep pace with demands from Engineering
  • Develop and refine monitoring, logging, and alerting patterns appropriate for customer-hosted deployments
  • Collaborate closely with Engineering, Customer Success, and Support to translate field learnings into product and operational improvements
  • Create operational playbooks—written with the clarity and structure that makes them useful to teammates, customers, and the AI-augmented workflows the team relies on
  • Contribute to Proscia’s technical presence—whether through internal demos, engineering blog posts, or operational knowledge sharing that raises the bar for how the team works

Requirements

  • Deep hands-on experience deploying and operating containerized applications using container orchestration in production environments
  • Strong Linux systems expertise (process management, networking, storage, security hardening, performance tuning)
  • Expert troubleshooting skills in distributed systems across application, container, and infrastructure layers
  • Experience with enterprise networking—you can troubleshoot and recommend corrections in customer infrastructure. Comfortable operating software in customer-managed and on-premise environments
  • Experience supporting data-intensive systems, ideally involving large image files or compute-heavy workloads
  • Working knowledge of observability practices (logs, metrics, tracing) and pragmatic monitoring approaches in non-cloud-native environments
  • Comfort working directly with customers or customer-facing teams to resolve high-impact issues
  • You already use AI tools in your operational work, in troubleshooting, writing automation, analyzing logs, or however it fits your practice
  • A mindset aligned with Proscia’s values: ownership, speed, simplification, and a willingness to challenge the status quo
  • Experience building with or on top of LLMs, AI agents, or agentic pipelines
  • Demonstrated fluency applying AI tools to real operational problems beyond basic code completion
  • Familiarity with prompt engineering, tool use patterns, and evaluation of AI systems—you know when AI output is production-ready and when it needs different guardrails

Nice to have

  • Experience with healthcare or regulated environments
  • Exposure to Kubernetes (for hybrid or future-state deployments)
  • Experience with infrastructure automation or configuration management tools
  • Familiarity with database performance tuning for large datasets
  • Experience supporting GPU-enabled workloads
  • Open-source contributions, side projects, or a portfolio that shows how you think and build
  • Background that spans multiple domains or disciplines
  • Active in technical communities, forums, or meetups

What we offer

  • competitive pay
  • savings, schedule, and insurance options that promote long-term health and personal growth

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Reliability Engineer

8 matching positions

Reliability Engineer

This is a fantastic opportunity to join our famous Marmite site and become part ...
Location
Location
United Kingdom , Burton-on-Trent
Salary
Salary:
Not provided
unilever.com Logo
Unilever
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • NVQ L3 in Engineering and Post Apprenticeship skill level
  • Knowledge and proven experience working with process equipment ideally within a food manufacturing environment
  • Excellent communicator able to work well as part of a team
  • Instrumentation and Control Skills
  • Mechanical Skills (Pump, conveyor and valve repairs)
  • Previous experience report writing, conducting data analysis and mechanical calculation experience
  • Decision maker
  • Coach to the team using WCM/TPM tools
  • Previous experience in contractor management
  • Self-manage workload and priorities
Job Responsibility
Job Responsibility
  • Delivering the medium and long-term maintenance strategy for the factory
  • Running diagnostics and troubleshooting faults through root cause analysis
  • Defining preventive maintenance strategy to minimise breakdowns
  • Proactively recommend design improvements
  • Lead machine technical review to improve performance
  • Prioritising and resolving abnormalities of the line
  • Lead, coach, and mentor Shift Engineers and Technical Operators
What we offer
What we offer
  • Competitive salary
  • Pension scheme
  • Annual bonus
  • Subsidised gym membership
  • Discounted staff shop
  • Shares
  • Flexible working options
  • Family-friendly and inclusive workplace
  • Fulltime
Read More
Arrow Right
New

Reliability Engineer

Figure is an AI robotics company developing autonomous general-purpose humanoid ...
Location
Location
United States , San Jose
Salary
Salary:
120000.00 - 250000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 2 years of work experience in reliability engineering
  • Bachelor's degree or higher in mechanical, electrical, materials engineering or related field
  • Strong knowledge of hardware failure modes and failure physics
  • Strong knowledge in developing accelerated life test specs and methods, such as high temperature high humidity, thermal cycle, vibration, shock, and load/use cycle tests
  • Familiarity with Weibull++, JMP, or related data analysis software
  • Experience in hardware assembly and instrumentation setup
Job Responsibility
Job Responsibility
  • Collect and analyze current and prospective product use cases to define hardware reliability specs and validation methods
  • Lead DFMEA efforts with design engineers to analyze and assess potential design risks, detection and mitigation methods
  • Develop and execute accelerated life tests for materials, modules, and systems, e.g. structural parts, joints, softgoods, robot head/torso, charger, EMC, full robot and robot ecosystem product
  • Work with test engineers to develop custom test fixtures, electronics, and codes
  • Working with cross-functional teams, own reliability test planning and preparation in-house and at supplier sites
  • Drive and manage internal/external resources and suppliers in executing reliability tests in each engineering build
  • Support failure analysis from reliability tests, fleet operations, and field uses
  • Document failures and work with design and manufacturing engineers to resolve issues
  • Conduct physics-based and statistical analysis on test and field failures to assess risk and provide actionable recommendations
  • Fulltime
Read More
Arrow Right

Reliability Engineer

At British Engineering Services, we pride ourselves on being the leading end to ...
Location
Location
United Kingdom , Newcastle upon Tyne
Salary
Salary:
37000.00 GBP / Year
besgroup.com Logo
BES Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ideally a background in engineering within the marine, rail, or manufacturing environment
  • Level 1 Vibration Analysis or working towards this is essential
  • Experience or knowledge of Vibration Analysis
  • Experience or knowledge of condition monitoring hardware and software
  • A Mechanical Engineering qualification (level 3 upwards) is highly regarded
  • Flexibility to work away and travel as per business and customer requirements
  • Full UK driving license
Job Responsibility
Job Responsibility
  • Carry out condition based maintenance (CBM) techniques utilising vibration analysis, ultrasound, thermography and oil analysis
  • Perform data collection – the analysis of equipment performance, failure data and corrective maintenance history
  • Assess and report on machine performance and recommend improvements
  • Spec, set up and installation of online and wireless systems or remote sensors
  • Always provide the exceptional level of customer service expected from our team, whilst representing our brilliant company professionally
What we offer
What we offer
  • Company vehicle
  • Company Pension Scheme
  • Annual salary review
  • 25 days annual leave plus 8 bank holidays
  • An extra day’s holiday to take on Christmas Eve each year
  • Access to our buy and sell holiday scheme
  • Opportunity for flexible working
  • Electric Vehicle salary sacrifice scheme
  • Discounts and savings via our employee benefits portal
  • Health and wellbeing support via our Employee Assistance Programme
  • Fulltime
Read More
Arrow Right

Reliability Engineer

The position is responsible for bringing DevOps/SRE (Site Reliability Engineerin...
Location
Location
Poland , Warsaw
Salary
Salary:
165020.00 - 280980.00 PLN / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Software development experience scripting in Python/Shell
  • Understanding of SRE principles, DevOps, CI/CD pipeline and tools
  • Experience analyzing data to drive decisions
  • Strong analytical, algorithmic, and problem-solving skills
  • Excellent teamwork, proactive attitude, strong communication skills, both written and oral
Job Responsibility
Job Responsibility
  • Work in an agile software development environment, developing quality and scalable software solutions using leading-edge technologies
  • Work closely with developers, engineers and non-technology employees to help them be more productive with the use of the CI/CD tools
  • Collaborate with Citi Developer Services engineers to automate manual and repetitive processes, integrate services with AI (by building and maintaining MCP - Model Context Protocol servers), enhance system resiliency, and coordinate service issue investigations by deploying best practices
  • Automate manual activities, repetitive processes, reporting, controls, etc., configure and tune them
  • Continuously improve systems resiliency, reliability - through a design and development of software solutions and streamlined processes
  • Mitigate risk by analyzing the root cause of production issues, impacts to business, and required corrective actions
What we offer
What we offer
  • Employer paid Defined Contribution Pension Plan contribution of 6% of employee’s pensionable earnings (PPE Program)
  • Employer paid Private Medical Care Package for employees and Private Medical Care Packages for certain family members available at preferential rates
  • Employer paid Life Insurance Program for employees and Life Insurance for certain family members available at preferential rates
  • Employee Assistance Program financed by Employer
  • Paid Parental Leave Program (maternity and paternity leave
  • statutory and 2 weeks additional paid paternity leave)
  • Sport Card for employees subsidised via Social Benefits Fund and Sport Cards for certain family members available at preferential rates
  • Additional benefits from Company’s Social Benefit Fund, in particular: Holidays Allowance, support for sport and cultural activities, team building events
  • Additional day off for volunteering
  • Cafeteria/ flex benefit – a company benefits system which enables employees to select and purchase benefits offered by a provider and available for employees on the platform
  • Fulltime
Read More
Arrow Right

Reliability Engineer

Aurizon is seeking a Reliability Engineer to provide civil engineering services ...
Location
Location
Australia , Rockhampton / Mackay
Salary
Salary:
127556.00 AUD / Year
aurizon.com.au Logo
Aurizon
Expiration Date
June 19, 2026
Flip Icon
Requirements
Requirements
  • Bachelor of Civil Engineering
  • practical experience developing and delivering civil engineering solutions
  • confident working with guidance, applying sound judgement and building strong stakeholder relationships to deliver quality outcomes
  • develop and deliver civil engineering designs, specifications, and solutions across construction and maintenance activities
  • review, check, and resolve issues in engineering designs to improve system performance and ensure quality outcomes
  • plan and deliver work programs, including developing estimates, budgets, and schedules for project activities
  • contribute to the development and implementation of engineering standards, procedures, and best practice across the network
Job Responsibility
Job Responsibility
  • Organise and plan technical aspects of civil projects, including design, specifications, and safety in design for new and modified works
  • Review, check, and audit civil designs from internal teams and external providers to ensure quality and compliance
  • Prepare end-to-end project documentation, including requirements, procedures, testing, commissioning, and asset handover
  • Ensure designs align with business and industry standards, and contribute to developing new processes, standards, and work practices
What we offer
What we offer
  • Development and growth opportunities
  • Access to mentoring and development programs
  • Discounts on selected health insurance funds, personal travel, gyms, vehicles and retail brands
  • Parental leave program and super booster
  • Fulltime
Read More
Arrow Right

Reliability Engineer

We are seeking a talented and experienced Senior Reliability Engineer to join ou...
Location
Location
Belgium , Wavre
Salary
Salary:
57675.00 - 96125.00 EUR / Year
us.gsk.com Logo
GSK
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in Engineering (Mechanical, Electrical, Industrial, or related field).
  • Minimum of 7 years of experience in reliability engineering, preferably in the pharmaceutical industry or a related field.
  • In-depth knowledge of reliability methodologies such as Failure Modes and Effects Analysis (FMEA), Root Cause Analysis (RCA), Reliability-Centered Maintenance (RCM), etc.
  • Proficiency in data analysis and use of Computerized Maintenance Management Systems (CMMS).
  • Excellent communication and teamwork skills.
  • Ability to manage multiple projects and priorities effectively.
Job Responsibility
Job Responsibility
  • Develop and implement reliability strategies (Local (MU) and transversal (Belgium)) to enhance equipment performance and reduce downtime.
  • Analyze reliability data to identify trends, root causes of failures, and recommend + implement and document corrective actions.
  • Collaborate with maintenance, production, and engineering teams to optimize preventive and predictive maintenance programs.
  • Perform deep analysis of Emergency events in order to propose and implement major improvements of the equipment availability
  • Conduct criticality analyses of equipment and processes to prioritize improvement efforts.
  • Participate in the design and implementation of new equipment and processes, ensuring that reliability principles are integrated from the start.
  • Support CAPEX projects to ensure reliability, sustainability, and maintainability are considered and integrated.
  • Prepare and present reliability points during L3 and L4 audits.
  • Train and mentor team members and non-reliability teams on reliability best practices.
  • Expert in the use and facilitation of DMAIC (Define, Measure, Analyze, Improve, Control) methodology for continuous improvement projects.
What we offer
What we offer
  • Competitive base salary
  • Annual bonus based on company performance
  • Flexible working options available for most roles
  • Learning and career development
  • Access to healthcare & wellbeing programmes
  • Employee recognition programmes
  • Fulltime
Read More
Arrow Right

Reliability Engineer

As a Reliability Engineer, you'll lead proactive engineering initiatives to maxi...
Location
Location
United Kingdom , Diss
Salary
Salary:
Not provided
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Apprenticeship trained engineer with experience in manufacturing
  • Strong mechanical and fault-finding skills
  • Experience with predictive maintenance & reliability engineering
  • Proven data analysis skills (Power BI preferred)
  • Ability to perform Root Cause Analysis (RCA)
  • A proactive, problem-solving mindset in a fast-paced manufacturing environment
  • Corrugated industry experience, OMP systems, RCM or CMMS knowledge, multi-skilled background (desirable)
Job Responsibility
Job Responsibility
  • Drive equipment reliability & downtime reduction through root cause analysis and corrective actions
  • Develop and optimise predictive & preventative maintenance (PPM) strategies
  • Use tools like Power BI & OMP to analyse performance and identify improvement opportunities
  • Apply predictive techniques (vibration, thermal imaging, oil analysis, etc.)
  • Support continuous improvement and long-term engineering strategy
  • Contribute to capital projects, upgrades, and innovation initiatives
  • Promote safe working practices across the site
What we offer
What we offer
  • Competitive salary & benefits, including annual leave, pension, and a Cycle to Work scheme
  • Ongoing training and development opportunities
  • 24/7 confidential support for you and your family
  • Flexible working options and family-friendly policies
  • Guaranteed interview for candidates meeting essential criteria (Disability Confident Employer)
  • Fulltime
Read More
Arrow Right

Reliability Engineer

LIFE AT DELEK CAREERS FAQs LOCATIONS SIGN IN / CREATE PROFILE Show More Options ...
Location
Location
United States , El Dorado
Salary
Salary:
Not provided
delekus.com Logo
Delek US
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4 year / Bachelor's Degree (Required)
  • In lieu of the above education requirements, an equivalent combination of education and experience may be considered
  • Two (2) or more years Oil & Gas, or related experience (Required)
  • No Licensure or Certification Required
  • Reliability Management
  • Asset Management
  • Fixed Equipment
  • Rotating Equipment
  • Pressue Control Devices
  • Pressure Vessels
Job Responsibility
Job Responsibility
  • Responsible for sustaining and continuously improving various mechanical components for equipment and tools
  • Ensures the safe, effective operations of the organization's production and supports continuous improvement
  • Manages reliability engineering projects
  • Performs analytical verification
  • Evaluates, tests and tracks results of reliability interventions
  • Initiates reporting for internal or third-party reported incidents
  • Creates, documents and follows up on corrective actions
  • Prepares routine reports and memos and coordinate communications across all necessary functional groups of the organization
What we offer
What we offer
  • up to a 10% match on 401K on your hire start, with a vesting timeline of only one year
  • medical benefits that start on day one with a 30% premium rebate annually
  • access to the Calm app for FREE
  • earn additional annual incentives as you set and achieve goals
  • Fulltime
Read More
Arrow Right