CrawlJobs Logo
Briefcase Icon
Category Icon

Reliability Engineer United States, Sunnyvale Jobs

4 Job Offers

Filters
Staff Site Reliability Engineer
Save Icon
Join our team as a Staff Site Reliability Engineer in Sunnyvale. You will own critical internal infrastructure, migrating systems and ensuring high availability. We seek expertise in cloud platforms (AWS/Azure/GCP), IaC (Terraform/Ansible), and building automated, reliable systems. Drive automati...
Location Icon
Location
United States , Sunnyvale
Salary Icon
Salary
175000.00 - 250000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Site Reliability Engineer SRE – ML platform
Save Icon
Join our team in Sunnyvale as a Site Reliability Engineer for our ML platform. You will design AWS cloud solutions and build MLOps pipelines using Kubernetes, Docker, and Python. This role involves deploying ML models and collaborating with data scientists, requiring strong expertise in Kubeflow,...
Location Icon
Location
United States , Sunnyvale
Salary Icon
Salary
Not provided
thirdeyedata.ai Logo
Thirdeye Data
Expiration Date
Until further notice
Reliability Engineer
Save Icon
Join Meta Reality Labs as a Reliability Engineer in Sunnyvale. You will ensure the reliability of cutting-edge AR/VR and wearable products. Develop test plans, conduct FMEA, and analyze failure mechanisms for new AI-native hardware. This role requires 5+ years of experience and offers a competiti...
Location Icon
Location
United States , Sunnyvale
Salary Icon
Salary
144000.00 - 204000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Reliability Engineer
Save Icon
Join Meta Reality Labs as a Reliability Engineer in Sunnyvale. You will ensure the reliability of cutting-edge AR/VR and wearable products. Develop test plans, conduct FMEA, and perform failure analysis using industry standards. This role requires a BS degree and 3+ years of experience in reliabi...
Location Icon
Location
United States , Sunnyvale
Salary Icon
Salary
118000.00 - 170000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Explore a dynamic and critical career path with Reliability Engineer jobs, a profession dedicated to ensuring systems and assets operate with maximum uptime, performance, and efficiency. Reliability Engineers are the guardians of operational integrity, applying engineering principles and data-driven analysis to prevent failures, optimize performance, and implement sustainable processes. This field broadly splits into two key domains: IT/Software Reliability and Industrial/Physical Asset Reliability, both united by the core mission of building and maintaining resilient systems. In the technology sector, often titled Site Reliability Engineer (SRE), professionals blend software engineering and systems administration to create scalable and highly reliable software platforms. Their general responsibilities include designing and automating infrastructure deployment, building robust monitoring and alerting systems, and managing incident response through on-call rotations. They focus on key service level indicators (SLIs) and objectives (SLOs) to measure and improve user experience. Typical tasks involve writing code for automation, conducting post-incident reviews (blameless postmortems), and collaborating with development teams to embed reliability into the software lifecycle from the start. Common requirements for these roles include proficiency in programming/scripting (e.g., Python, Go), expertise in cloud platforms (AWS, Azure, GCP), container orchestration (Kubernetes), and infrastructure-as-code tools, alongside a strong grasp of CI/CD pipelines and observability stacks. Conversely, in industrial settings like manufacturing, energy, or oil and gas, Reliability Engineers focus on physical assets such as rotating equipment, electrical systems, and production machinery. Their work is centered on predictive and preventive maintenance strategies. General duties involve analyzing equipment performance data, conducting Root Cause Failure Analysis (RCFA), developing maintenance procedures, and managing reliability-centered maintenance (RCM) programs. They use statistical analysis and reliability modeling to predict asset lifecycles, recommend improvements, and manage risk. Typical skills include a strong mechanical or electrical engineering foundation, knowledge of condition monitoring technologies (vibration analysis, thermography), familiarity with Computerized Maintenance Management Systems (CMMS), and expertise in process safety and lifecycle cost analysis. Across both domains, successful Reliability Engineers are systematic problem-solvers with a proactive mindset. They possess strong analytical skills to interpret complex data, excellent communication skills to collaborate across teams and justify investments, and a relentless focus on continuous improvement. Whether ensuring a global web service remains online or a refinery operates safely and efficiently, Reliability Engineer jobs are foundational to modern operational excellence. For those passionate about building systems that don't fail and optimizing performance through engineering, this profession offers a challenging and impactful career with opportunities spanning virtually every industry.

Filters

×
Category
Location
Work Mode
Salary