CrawlJobs Logo

Failure Analysis Engineer

etched.com Logo

Etched

Location Icon

Location:
Taiwan , Taipei

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Etched is hiring a Failure Analysis Engineer to own the end-to-end debug process across our full hardware stack: chip, board, and rack-scale systems. You will be responsible for rapidly diagnosing, triaging, and resolving hardware failures; determining whether issues originate in the chip, board, or rack infrastructure; and driving resolution with the appropriate team. This is a highly cross-functional role, working closely with US-based hardware and silicon teams to build and refine debug playbooks as production scales. The ideal candidate has deep EE fundamentals, systems-level debugging experience, and the ability to solve hard problems under pressure.

Job Responsibility:

  • Own failure triage across the stack. Receive field and production failures, isolate whether the root cause is chip, board-level, or system/rack-level, and route to the appropriate team with a clear problem statement
  • Drive root cause analysis using electrical test equipment (oscilloscopes, logic analyzers, multimeters) and system-level diagnostics to identify failure mechanisms and determine corrective actions
  • Build and refine debug processes. Partner with US hardware counterparts to document debug flows for different failure modes, creating repeatable playbooks that scale with production volume
  • Debug rack-level issues. Troubleshoot communication failures between rack managers, CDUs, and system components. Understand how thermal, power, and network infrastructure interact at the rack scale
  • Interface with BMC and system firmware. Use Linux command line and BMC interfaces to pull logs, run diagnostics, and validate system health during failure investigations
  • Close the loop on quality. Feed failure trends and root cause findings back to design, manufacturing, and operations teams to drive systemic improvements

Requirements:

  • Bachelor’s or Master’s degree in Electrical Engineering or a related field
  • Fluency in oscilloscopes, signal integrity basics, power delivery, and board-level debug
  • Systems-level thinking. Strong understanding of how servers work end-to-end: BMC, BIOS, OS, thermals, and power sequencing. Can debug issues that span multiple subsystems
  • Linux command line proficiency. Comfortable with CI pulling logs, running scripts, and navigating server environments from the terminal
  • Strong communication skills across teams. You can translate a complex hardware failure into a clear problem statement for silicon, firmware, or mechanical teams. You've worked across time zones and functions
  • Composure under pressure. Production failures don't wait. You're energized by urgent, ambiguous problems and take ownership until they're resolved
  • 3+ years of experience in hardware debug, failure analysis, or systems engineering in a server, datacenter, or semiconductor environment

Nice to have:

  • Rack-scale infrastructure (cooling systems, power distribution, rack managers)
  • High-speed interfaces (PCIe, Ethernet, SerDes) and their common failure modes
  • ATE or production test environments
  • Experience with Datacenters, GPUs, FPGAs, or custom ASICs
What we offer:
  • Competitive compensation packages, including generous equity packages
  • Comprehensive insurance coverage and other top-of-market benefits

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Failure Analysis Engineer

Loads Process Engineer

Archer is an aerospace company based in San Jose, California building an all-ele...
Location
Location
United States , San Jose
Salary
Salary:
Not provided
archer.com Logo
Archer Aviation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS / MS in Aerospace Engineering or a related field
  • 5+ years of experience developing processes for loads analysis and downselection for vehicles using multibody dynamics solvers
  • Working knowledge of structural analysis and test, including a fundamental understanding of strength, stability and fatigue failure mechanisms
  • Proficiency in Python and/or Matlab scripting
  • Familiarity with software development best practices and version control
  • Strong technical, written, and verbal communication skills
  • Ability to work in groups and individually
  • Experience in a fast-paced design environment
Job Responsibility
Job Responsibility
  • Own the development, documentation, and improvement of the loads development process
  • Collaborate with teams that interface with loads (aerodynamics, airframe, engine, mass properties, mechanical systems, propeller, etc.)
  • Responsible for providing static and fatigue loads that are directly consumable by all downstream users (airframe, engine, mechanical systems, propeller, etc.)
  • Responsible for providing clear documentation of loads assumptions and methods to generate traceable and repeatable analyses
  • Support wind tunnel and flight test planning and execution in the areas of propeller loads, aeroelasticity, and aeromechanics, including test plan development, instrumentation and data acquisition definition, test monitoring and execution, data analysis and interpretation, and test report writing
  • Participate in loads means and methods of compliance discussions with the FAA
  • Support preparation of certification test plans
  • Support ground vibration testing
  • Fulltime
Read More
Arrow Right

Reliability Engineer

Founded in 1985, ATS is a company with a presence in the United States, Mexico a...
Location
Location
United States , Tupelo, Mississippi
Salary
Salary:
Not provided
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited) or equivalent experience (ex. heavy industrial maintenance, reliability, or operations experience)
  • Minimum of one year of reliability experience
  • Demonstrates ability to use reliability tool sets
  • Experience in Performance of RCA
  • Involvement with RCM & FMEA
  • Master Level Proficiency in Predictive Technology
  • Vibration I Certification
  • Machine Health Monitoring Intermediate Proficiency
  • Experience with Work Execution Management
  • Technical understanding of electrical or mechanical components, tools, and designs
Job Responsibility
Job Responsibility
  • Promotes and adheres to the ATS safety culture
  • Ensures compliance with regulatory requirements and ATS policies and procedures
  • Partners with internal/external customer for engineered solutions to improve reliability and throughput
  • Identifies opportunities for Capital Expenditures for equipment replacement (develops and communicates ROI)
  • Highly knowledgeable in operating systems, critical elements, and best practices to enable a precision reliability culture
  • Knowledgeable application of common precision tools and practices
  • Partners with peers to perform reliability centered maintenance and deliverables (equipment specific maintenance plan -ESMP)
  • Actively collaborates with maintenance team on the use of predictive, preventative, and precision maintenance technologies and strategies designed to identify or control risks prior to failure and ensure optimum maintenance execution
  • Partners with peers to perform failure mode & effects analysis
  • Understands Work Execution Management (WEM) & improvements identified through reliability strategy session performance
  • Fulltime
Read More
Arrow Right

Hardware Engineering Manager

The HW Sustaining Engineering group in HPE Networking Products and Advanced Tech...
Location
Location
United States , San Jose
Salary
Salary:
130500.00 - 300000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Electrical Engineering, Computer Science or equivalent
  • Experience in high-speed digital hardware design, test and de-bug
  • 3+ years of experience leading a hardware engineering team in a design, debug or quality role
  • Team leadership experience includes coaching, team building and supporting career growth of team members
  • Expertise in hardware quality, failure analysis, root-cause/corrective action (RCCA)
  • Familiarity with networking hardware components, interfaces, and systems
  • Ability to review system level test plans and test results and distill down to hardware specifics
  • Demonstrated success in managing multiple concurrent programs and priorities and delivering results on schedule
  • Excellent verbal and written communication skills and experience communicating with executive leadership and major customers
  • Skilled at working across organizational and geographical boundaries to drive a common goal
Job Responsibility
Job Responsibility
  • Lead a team of experienced HW engineers responsible for supporting HPE networking hardware products
  • Engage with customers and support teams to review, investigate and resolve customer escalations related to HPE networking hardware
  • Manage multiple concurrent failure analysis and design change projects
  • Manage priorities, allocate resources, track progress and drive to closure on schedule
  • Author RCCA (Root cause / Corrective Action) and other hardware quality related presentations and present to customers and Juniper stakeholders
  • Drive proactive quality improvement in new products through closed loop corrective actions and feedback into design teams
  • Collaborate with Supply Chain and Component Engineering to address component EOL (end-of-life) replacement, second source, and value engineering priorities
  • Collaborate with Manufacturing, CM (Contract Manufacturing) and ODM teams to resolve critical issues seen in manufacturing and ensure production hardware quality
  • Track and improve team performance
  • Coach and develop team members through regular engagement, one-on-one's and OKRs
What we offer
What we offer
  • Comprehensive suite of benefits that supports their physical, financial and emotional wellbeing
  • Programs catered to career development
  • Inclusive work environment valuing diverse backgrounds
  • Fulltime
Read More
Arrow Right

Field Service Reliability Engineer

Founded in 1985, ATS is a company with a presence in the United States, Mexico a...
Location
Location
United States , Hammond
Salary
Salary:
Not provided
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited)
  • Eight or more years of reliability experience across 2 or more manufacturing sites
  • Demonstrates ability to perform full array of reliability tool sets
  • Strong technical understanding of electrical or mechanical components, tools, and designs
  • Ability to complete a failure mode effects analysis, cause and effect diagrams, root cause failure analysis, life-cycle costing, and risk analysis
  • Ability to research and apply new equipment technology / trends
  • Robust problem solving, mathematical, analytical, and decision making skills
  • Proficiency with computers, maintenance systems, and applications, including Microsoft Office
  • Excellent verbal communication, facilitation, and presentation skills
  • Strong reporting and technical writing capability
Job Responsibility
Job Responsibility
  • Extensive travel required. (Local, National, International)
  • Promotes and adheres to the ATS safety culture
  • Engages in various work environments and industries to lead reliability centered maintenance efforts
  • Mentors, coaches, and provides reliability best practices for applications in customer facilities, by customer personnel
  • Identifies top potential issues leading to lost production and preventable maintenance spending. Communicates findings with leadership
  • Provides solutions to root cause deficiencies and demonstrates economic benefits to their correction
  • Actively drives the implementation of equipment improvement projects
  • Identifies and implements current and new processes / technologies to increase equipment performance and uptime
  • Champions systems and best practice procedures towards a proactive manufacturing culture
  • Analyzes equipment performance, failure data, and corrective maintenance history to develop and deploy engineering solutions, improved maintenance strategies, preventative maintenance optimization, and other reliability techniques
  • Fulltime
Read More
Arrow Right

Field Reliability Services Engineer

Founded in 1985, ATS is a company with a presence in the United States, Mexico a...
Location
Location
United States , Greenville
Salary
Salary:
Not provided
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited)
  • Eight or more years of reliability experience across 2 or more manufacturing sites
  • Demonstrates ability to perform full array of reliability tool sets
  • Strong technical understanding of electrical or mechanical components, tools, and designs
  • Ability to complete a failure mode effects analysis, cause and effect diagrams, root cause failure analysis, life-cycle costing, and risk analysis
  • Ability to research and apply new equipment technology / trends
  • Robust problem solving, mathematical, analytical, and decision making skills
  • Proficiency with computers, maintenance systems, and applications, including Microsoft Office
  • Excellent verbal communication, facilitation, and presentation skills
  • Strong reporting and technical writing capability
Job Responsibility
Job Responsibility
  • Extensive travel required. (Local, National, International)
  • Promotes and adheres to the ATS safety culture
  • Engages in various work environments and industries to lead reliability centered maintenance efforts
  • Mentors, coaches, and provides reliability best practices for applications in customer facilities, by customer personnel
  • Identifies top potential issues leading to lost production and preventable maintenance spending. Communicates findings with leadership
  • Provides solutions to root cause deficiencies and demonstrates economic benefits to their correction
  • Actively drives the implementation of equipment improvement projects
  • Identifies and implements current and new processes / technologies to increase equipment performance and uptime
  • Champions systems and best practice procedures towards a proactive manufacturing culture
  • Analyzes equipment performance, failure data, and corrective maintenance history to develop and deploy engineering solutions, improved maintenance strategies, preventative maintenance optimization, and other reliability techniques
  • Fulltime
Read More
Arrow Right

Field Service Reliability Engineer

Founded in 1985, ATS is a company with a presence in the United States, Mexico a...
Location
Location
United States , Hammond, Indiana
Salary
Salary:
Not provided
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited)
  • Eight or more years of reliability experience across 2 or more manufacturing sites
  • Demonstrates ability to perform full array of reliability tool sets
  • Strong technical understanding of electrical or mechanical components, tools, and designs
  • Ability to complete a failure mode effects analysis, cause and effect diagrams, root cause failure analysis, life-cycle costing, and risk analysis
  • Ability to research and apply new equipment technology / trends
  • Robust problem solving, mathematical, analytical, and decision making skills
  • Proficiency with computers, maintenance systems, and applications, including Microsoft Office
  • Excellent verbal communication, facilitation, and presentation skills
  • Strong reporting and technical writing capability
Job Responsibility
Job Responsibility
  • Extensive travel required. (Local, National, International)
  • Promotes and adheres to the ATS safety culture
  • Engages in various work environments and industries to lead reliability centered maintenance efforts
  • Mentors, coaches, and provides reliability best practices for applications in customer facilities, by customer personnel
  • Identifies top potential issues leading to lost production and preventable maintenance spending. Communicates findings with leadership
  • Provides solutions to root cause deficiencies and demonstrates economic benefits to their correction
  • Actively drives the implementation of equipment improvement projects
  • Identifies and implements current and new processes / technologies to increase equipment performance and uptime
  • Champions systems and best practice procedures towards a proactive manufacturing culture
  • Analyzes equipment performance, failure data, and corrective maintenance history to develop and deploy engineering solutions, improved maintenance strategies, preventative maintenance optimization, and other reliability techniques
  • Fulltime
Read More
Arrow Right

Field Service Reliability Engineer

Founded in 1985, ATS is a company with a presence in the United States, Mexico a...
Location
Location
United States , Chicago, Illinois
Salary
Salary:
50.96 - 65.19 USD / Hour
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited)
  • Eight or more years of reliability experience across 2 or more manufacturing sites
  • Demonstrates ability to perform full array of reliability tool sets
  • Strong technical understanding of electrical or mechanical components, tools, and designs
  • Ability to complete a failure mode effects analysis, cause and effect diagrams, root cause failure analysis, life-cycle costing, and risk analysis
  • Ability to research and apply new equipment technology / trends
  • Robust problem solving, mathematical, analytical, and decision making skills
  • Proficiency with computers, maintenance systems, and applications, including Microsoft Office
  • Excellent verbal communication, facilitation, and presentation skills
  • Strong reporting and technical writing capability
Job Responsibility
Job Responsibility
  • Promotes and adheres to the ATS safety culture
  • Engages in various work environments and industries to lead reliability centered maintenance efforts
  • Mentors, coaches, and provides reliability best practices for applications in customer facilities, by customer personnel
  • Identifies top potential issues leading to lost production and preventable maintenance spending. Communicates findings with leadership
  • Provides solutions to root cause deficiencies and demonstrates economic benefits to their correction
  • Actively drives the implementation of equipment improvement projects
  • Identifies and implements current and new processes / technologies to increase equipment performance and uptime
  • Champions systems and best practice procedures towards a proactive manufacturing culture
  • Analyzes equipment performance, failure data, and corrective maintenance history to develop and deploy engineering solutions, improved maintenance strategies, preventative maintenance optimization, and other reliability techniques
  • Provides technical service to operations and manufacturing personnel on equipment related troubleshooting efforts
  • Fulltime
Read More
Arrow Right

Field Reliability Services Engineer

Field Reliability Services Engineer role requiring 95% travel. Promotes safety, ...
Location
Location
United States , Greenville
Salary
Salary:
Not provided
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited)
  • Eight or more years of reliability experience across 2 or more manufacturing sites
  • Demonstrates ability to perform full array of reliability tool sets
  • Strong technical understanding of electrical or mechanical components, tools, and designs
  • Ability to complete a failure mode effects analysis, cause and effect diagrams, root cause failure analysis, life-cycle costing, and risk analysis
  • Ability to research and apply new equipment technology / trends
  • Robust problem solving, mathematical, analytical, and decision making skills
  • Proficiency with computers, maintenance systems, and applications, including Microsoft Office
  • Excellent verbal communication, facilitation, and presentation skills
  • Strong reporting and technical writing capability
Job Responsibility
Job Responsibility
  • Extensive travel required. (Local, National, International)
  • Promotes and adheres to the ATS safety culture
  • Engages in various work environments and industries to lead reliability centered maintenance efforts
  • Mentors, coaches, and provides reliability best practices for applications in customer facilities, by customer personnel
  • Identifies top potential issues leading to lost production and preventable maintenance spending. Communicates findings with leadership
  • Provides solutions to root cause deficiencies and demonstrates economic benefits to their correction
  • Actively drives the implementation of equipment improvement projects
  • Identifies and implements current and new processes / technologies to increase equipment performance and uptime
  • Champions systems and best practice procedures towards a proactive manufacturing culture
  • Analyzes equipment performance, failure data, and corrective maintenance history to develop and deploy engineering solutions, improved maintenance strategies, preventative maintenance optimization, and other reliability techniques
  • Fulltime
Read More
Arrow Right