CrawlJobs Logo

Reliability Engineer

westfraser.com Logo

West Fraser

Location Icon

Location:
United States , Dudley

Category Icon
Category:

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

The primary responsibility of this position is to improve the uptime of the plant and improve efficiency of the machinery and processes. This position also helps drive new ideas, systems, operational excellence, and continuous improvement across the site.

Job Responsibility:

  • Conduct and coordinate investigations of machine failures, including analyzing failure mechanisms and implementing solutions to improve reliability
  • Work with the maintenance teams to identify work practices and other opportunities to prevent machine failures
  • Develop, implement, and optimize preventative and predictive maintenance routes
  • Implement best practices for lubrication, including the selection and storage of lubricants
  • Analyze plant data to identify areas of focus for the maintenance team, areas of opportunity to reduce downtime, downtime trends by department and machine center, and other opportunities to improve plant performance
  • Training of the maintenance team to improve skills and knowledge
  • Redesign machinery and other components when necessary to improve plant reliability and lower operating costs
  • Utilize the mill’s computerized maintenance management system to manage lubrication and other maintenance activities
  • Assist in the planning and execution of maintenance activities
  • Develop and maintain effective systems for improving plant reliability
  • Apply technology such as vibration analysis, thermography, oil analysis, and other predictive techniques to improve the mill’s reliability program
  • Identify other opportunities across the site to improve efficiency, lower cost, and reduce waste
  • Special projects and other duties as assigned

Requirements:

  • Mechanical, Industrial, Electrical, or similar Engineering Degree or a strong combination of experience and education
  • Relevant experience with industrial manufacturing equipment and / or trades experience is an asset
  • Demonstrated quantitative analysis skills and proficiency in working with applications developers, IT and other technical domains
  • Experience in communicating and presenting data and technical concepts to audiences with varying backgrounds
  • Self-directed and motivated
  • Strong analytical and problem-solving abilities
  • Willingness to adapt and thrive in a collaborate team environment
  • Effective verbal and written communication skills
  • Computer proficiency in MS Excel, Word, and Power Point, Maximo
  • Previous industry experience is preferred
What we offer:
  • Benefits starting Day 1
  • On-the-job training
  • A culture that strongly believes in promoting from within
  • Medical
  • Dental
  • 401k with company match plus an additional retirement contribution
  • Employee stock purchase plan
  • Life Insurance
  • Disability Insurance
  • Paid vacations and holidays

Additional Information:

Job Posted:
January 04, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Reliability Engineer

Staff Software Engineer, Reliability

Join us in building the future of finance. Our mission is to democratize finance...
Location
Location
United States , Menlo Park
Salary
Salary:
217000.00 - 255000.00 USD / Year
robinhood.com Logo
Robinhood
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience designing and scaling distributed systems in production environments
  • Deep technical expertise in one or more programming languages (e.g., Python, Go, C++) and strong systems engineering fundamentals
  • Experience leading major infrastructure or reliability initiatives across multiple teams or domains
  • Track record of improving reliability metrics such as SLO adherence, MTTD/MTTR, or cost efficiency at scale
  • Strong mentorship and communication skills, with a focus on collaboration, clarity, and impact
Job Responsibility
Job Responsibility
  • Develop and build software, infrastructure and tools that improve observability, alerting, incident response, and system readiness
  • Serve as a technical leader and reliability domain expert across multiple teams, driving architectural decisions and cross-functional initiatives
  • Design and lead large-scale reliability efforts that impact Robinhood’s most critical systems and services
  • Lead Production Readiness Reviews, championing best practices in pre-production testing, SLO development, and incident response metrics
  • Mentor engineers, foster a reliability-first culture, and drive long-term improvements that reduce operational overhead and improve system health
What we offer
What we offer
  • Performance driven compensation with multipliers for outsized impact, bonus programs, equity ownership, and 401(k) matching
  • 100% paid health insurance for employees with 90% coverage for dependents
  • Lifestyle wallet - a highly flexible benefits spending account for wellness, learning, and more
  • Employer-paid life & disability insurance, fertility benefits, and mental health benefits
  • Time off to recharge including company holidays, paid time off, sick time, parental leave, and more
  • Exceptional office experience with catered meals, events, and comfortable workspaces
  • Fulltime
Read More
Arrow Right

Senior AI Site Reliability Engineer

At Schwab, you will build a rewarding career while making a difference in the li...
Location
Location
United States , San Francisco
Salary
Salary:
190000.00 - 270000.00 USD / Year
schwab.com Logo
Charles Schwab
Expiration Date
January 20, 2026
Flip Icon
Requirements
Requirements
  • 8+ years of software development or reliability engineering experience, with 4+ years as a hands-on senior engineer in startups and/or large organizations
  • Bachelor’s degree in Computer Science or related field
  • 5+ years of experience building and operating complex products from scratch and running them in production
  • 3+ years of experience supporting applications that use Artificial Intelligence (AI) models to deliver real business impact
  • 3+ years of experience building and maintaining data pipelines and infrastructure for large datasets
  • 3+ years of experience with containers and cloud-native applications, and the ability to operationalize them in the public cloud with infrastructure as code
  • Experience implementing monitoring, alerting, and incident response for large-scale distributed systems
  • Proven track record in driving reliability, scalability, and performance improvements for production AI systems
Job Responsibility
Job Responsibility
  • Design, implement, and manage the reliability and operational excellence of GenAI applications and platforms
  • Work closely with architects, engineers, and business leaders to align reliability practices with Schwab’s enterprise strategy
  • Mentor and coach junior engineers, helping to build strong operational practices and foster a culture of continuous improvement
  • Lead by example in solving complex reliability challenges, advancing SRE standards, and driving rapid iteration from concept to production
What we offer
What we offer
  • 401(k) with company match and Employee stock purchase plan
  • Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions
  • Paid parental leave and family building benefits
  • Tuition reimbursement
  • Health, dental, and vision insurance
  • Bonus or incentive opportunities
  • Fulltime
Read More
Arrow Right

Reliability & Maintainability Engineering Manager

At Boeing, we innovate and collaborate to make the world a better place. We’re c...
Location
Location
United States , Everett; Renton
Salary
Salary:
147050.00 - 198950.00 USD / Year
boeing.com Logo
Boeing
Expiration Date
January 16, 2026
Flip Icon
Requirements
Requirements
  • Bachelor of Science degree from an accredited course of study in engineering, engineering technology (includes manufacturing engineering technology), chemistry, physics, mathematics, data science, or computer science
  • 5+ years of experience leading engineering teams in R&M or related functional areas
  • Knowledge of the basic Principles, Processes and Lifecycle of Systems Engineering
  • Understanding concept of Technical Performance Measures (customer centric view of a product performance)
  • Knowledge of basic definitions of Reliability, Maintainability, Durability, and Availability
  • General knowledge of probability & statistics and the basis of such in Reliability & Safety analysis
  • Knowledge of System Modeling methods and relation to R&M modeling & analysis (Model Based Engineering)
  • High level knowledge of Airplane Systems and Structures of commercial or military airplanes
  • Demonstrated ability to work in a multi-discipline engineering environment
Job Responsibility
Job Responsibility
  • Develops project plans aligned to an Airplane Development Program and R&M strategy and objectives
  • Implements plans to ensure business, technical and customer requirements are achieved
  • Develops and monitors appropriate metrics to ensure performance to plan
  • Provides technical direction and guidance to the team regarding processes, tools, technology and deliverables
  • Ensures team products and processes meet customer, company, and regulatory requirements for quality and safety
  • Coaches, counsels, mentors and provides developmental opportunities to improve employee satisfaction and retain a skilled and motivated team
  • Forecasts and negotiates with internal customers and other R&M managers resource needs and recruit personnel if needed
  • Collaborates with other SEIT managers and team members
  • Establishes partnerships and good working relationships with internal customers, stakeholders, peers and direct report
What we offer
What we offer
  • Generous company match to your 401(k)
  • Industry-leading tuition assistance program pays your institution directly
  • Fertility, adoption, and surrogacy benefits
  • Up to $10,000 gift match when you support your favorite nonprofit organizations
  • Relocation based on candidate eligibility
  • Opportunity to enroll in a variety of benefit programs, generally including health insurance, flexible spending accounts, health savings accounts, retirement savings plans, life and disability insurance programs, and a number of programs that provide for both paid and unpaid time away from work
  • Fulltime
Read More
Arrow Right

Senior Reliability Engineer - PCBA, Harness & Connectors

We are looking for a Senior Reliability Engineer in charge of developing and exe...
Location
Location
United States , San Jose
Salary
Salary:
150000.00 - 225000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in relevant reliability engineering areas
  • Bachelor's degree or higher in relevant science and engineering fields
  • Strong knowledge of environmental reliability test principles, models, and methodologies, such as high temperature high humidity, thermal cycle/shock, mechanical vibration/shock
  • Strong knowledge of industry test standards such as AECQ, JEDEC, IPC standards
  • Strong knowledge of electrical circuits, PCBA design and relevant SW tools (e.g. Altium)
  • Strong knowledge of PCBA, harness and connector failure modes, mechanisms, and FA techniques
  • Hands-on experience on field reliability risk analysis and failure prediction methods
  • Hands-on experience with Weibull++, JMP, or other reliability statistical analysis software
  • Hands-on experience on electronic circuit debug and relevant tools, e.g. source meter, oscilloscope
  • Hands-on experience with 3D CAD tool (e.g. CATIA)
Job Responsibility
Job Responsibility
  • Work with cross-functional teams, own hardware reliability requirements and validation strategy
  • Develop and execute accelerated life tests for PCBAs, electronic components, electrical harness and connectors
  • Lead DFMEA efforts with design engineers to assess design risks, impacts, controls, and corrective actions
  • Design reliability test flows and procedures, communicate with internal and external/CM teams to execute tests and report results
  • Work with test engineers to design setup and fixtures used in reliability testing
  • Guide and support PCBA, harness, connector failure analysis, design of experiments (DOEs) and corrective action processes with cross-functional teams
  • Analyze field data, assess field risks, and design tests that correlate to field usage conditions
  • Fulltime
Read More
Arrow Right

Reliability Engineer

The Reliability Engineer is responsible for developing and leading asset reliabi...
Location
Location
United States , Bennettsville
Salary
Salary:
Not provided
domtar.com Logo
Domtar
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Mechanical Engineering or related technical field
  • Minimum five (5) years of experience in maintenance, reliability, or engineering within manufacturing or heavy industrial environments (pulp and paper experience preferred)
  • Strong knowledge of RCM, FMEA, RCFA, CMMS systems, and predictive maintenance technologies
  • Demonstrated commitment to safety and continuous improvement
Job Responsibility
Job Responsibility
  • Lead the development and execution of precision, preventive, and predictive maintenance strategies that improve equipment reliability
  • Champion Root Cause Problem Elimination (RCPE) and Failure Mode & Effects Analysis (FMEA) to proactively address equipment failures
  • Manage and optimize condition-based monitoring programs, including vibration, infrared, oil analysis, and ultrasound technologies
  • Establish and maintain robust systems and tools that enable maintenance and operations teams to monitor and interpret equipment and process health data effectively
  • Optimize maintenance strategies using asset criticality and reliability data to focus efforts on high-impact equipment
  • Analyze failure data and trends to identify systemic issues and drive continuous improvement initiatives
  • Collaborate with planning and scheduling teams to ensure timely and efficient execution of maintenance activities aligned with reliability goals
  • Serve as a subject matter expert on reliability tools, CMMS platforms, and emerging technologies
  • Develop and deliver training and communications to enhance reliability awareness and engagement among maintenance and operations personnel
  • Monitor and report on key reliability and maintenance KPIs, such as MTBF, MTTR, and OEE
What we offer
What we offer
  • competitive compensation
  • a supportive working environment
  • rewarding career paths
  • plenty of opportunities for learning and growth
  • Fulltime
Read More
Arrow Right

Database Reliability Engineer

We are committed to providing our customers with reliable and secure services at...
Location
Location
Netherlands
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Database Reliability Engineer

We are committed to providing our customers with reliable and secure services at...
Location
Location
Germany
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Site Reliability Engineering Manager

Hewlett Packard Enterprise (HPE) is looking for a Site Reliability Engineering M...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7–10 years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles
  • Minimum 2 years of experience managing or leading cloud operations teams
  • Deep understanding of cloud platforms (AWS, GCP, or Azure) and cloud-native architectures
  • Hands-on experience with Kubernetes, containers, infrastructure as code (e.g., Terraform), and configuration management tools
  • Strong foundation in observability (monitoring, logging, tracing), automation using Python, and incident response
  • Familiarity with modern CI/CD automation and tools
  • Excellent communication, stakeholder management, and team-building skills
  • Experience scaling SRE practices in high-growth or large-scale environments
  • Ability to balance long-term reliability initiatives with short-term delivery needs.
Job Responsibility
Job Responsibility
  • Lead and mentor a team of Site Reliability Engineers, supporting their growth, performance, and well-being
  • Own the reliability strategy for SASE cloud infrastructure systems, including incident management, SLIs/SLOs, and capacity planning
  • Partner with Engineering, Product, and Security teams to design and deliver highly available, scalable, and resilient cloud-native services
  • Guide the team in building automation, improving observability, and improve operational efficiency of our cloud infrastructure
  • Drive adoption of best practices in monitoring, alerting, on-call operations, and runbook development
  • Build and maintain a strong engineering culture based on ownership, collaboration, and continuous learning
  • Define and track key reliability metrics, and report on team performance and system health to leadership
  • Contribute to hiring, onboarding, and career development for SREs.
What we offer
What we offer
  • Health & Wellbeing benefits for physical, financial, and emotional wellbeing
  • Personal & Professional Development programs
  • Unconditional inclusion in the workplace.
  • Fulltime
Read More
Arrow Right