Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Podcasts
  • Courses
    • Your Courses
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
  • Barringer Process Reliability Introduction Course Landing Page
  • Upcoming Live Events
You are here: Home / Articles / Electronics Failure Analysis Done the Right Way

by Fred Schenkelberg 1 Comment

Electronics Failure Analysis Done the Right Way

Electronics Failure Analysis Done the Right Way

“Failures are gold”

Early in my career the engineering manager relished discovering equipment failure.

It didn’t matter if it was human, electronic, mechanical or software in nature, the glint in his eye soon gave way to a flood of possibilities. He enjoyed the process of investigating the fundamental reasons a failure occurred.

He wanted to know the cause(s) so we could design and operate a better system less likely to invoke those causes leading to failure. He was relentless in the pursuit of understanding what happened.

At times in my career, I’ve sent suspected bad parts or confirmed failed components to vendors for root cause analysis. That was often pure folly.

The responses often included:

“Yes, it failed” with no other explanation
“It’s working now, we do not see a failure here.”
“We know about this and it’s fixed already.”
“Oh, it was electrostatic discharge, it was your fault.” They had not received the part yet.
“You overstressed the part, it was a victim of poor design or handling.”

It was rare that a vendor would provide meaningful root cause analysis that provided actionable information for improvement. If it’s our design, in what way?

 

Failure Mechanisms and Modes

A failure mode is what is observed indicating a failure. It’s the plume of smoke rising from the back of the monitor. It’s the lack of response to the power switch. It’s the wrong answer appearing in your calculator. It’s how you know the product is not working.

A failure mechanism is the physical, chemical, thermodynamic or other processes that result in failure, which we then can observe as a failure mode.

Failure mechanisms are where the science exists, it’s what excited my boss. Understanding the science of failures allows us to select better materials, improve manufacturing processes, or minimize loads with the effect of improving reliability.

 

When a failure occurs do Root Cause analysis

The common term, “failure analysis” generally implies that we:

  • verify the failure occurred, try the power switch again
  • gather evidence exhibited indicating a failure, what are the symptoms or failure modes
  • recommend corrective or preventative action which may include short and long term solutions

This common approach tends to jump to action before understanding what actually happened. If you have a power supply that is not working and by replacing the analog controller component, the power supply works again, is it fair to conclude the replaced part was the root cause? No.

What about the replaced part failed? Was it faulty due to corrosion or mechanical bending? Was the part merely a victim of another element in the system causing a current overload that causes the part to fail?

Elements of a system fail for many different reasons and until you fully understand the root cause of the failure, any corrective or preventative action is little more than a guess at what will work.

Root Cause Analysis (RCA) include the detailed process of determining the failure mechanisms and the factors that cause that failure to occur.

RCA helps you understand if the faulty part is due to material contamination, poor material selection, over-stress, excessive degradation, etc. Understanding that the component is unable to withstand repeated inrush current cycles as each cycle accumulated damage to the aluminum lines on the IC, you can either change parts to one that can handle the stress or reduce the stress.

Simply replacing the part and saying the vendor is at fault for poor workmanship, is not going to solve the problem.

Failure analysis is largely done in the lab with the tools and expertise to tease out the root cause of a failure mode. Recently Bhanu Sood of NASA Goddard presented at the IPC APEX Expo on failure analysis, his slide set, Failure Analysis for Improved Reliability, is available online.

He provided an extensive overview of failure mechanisms and failure analysis techniques along with a couple case studies. It’s a very good read and certainly will spark your interest in celebrating failures.

Think of what you can learn and improve. I hope you get as excited by failures as my former boss.

Filed Under: Articles, Musings on Reliability and Maintenance Topics, on Product Reliability Tagged With: Failure analysis (FA)

About Fred Schenkelberg

I am the reliability expert at FMS Reliability, a reliability engineering and management consulting firm I founded in 2004. I left Hewlett Packard (HP)’s Reliability Team, where I helped create a culture of reliability across the corporation, to assist other organizations.

« Eliminating early life failures
General Human Factors Design Principles »

Comments

  1. Pavel Bochian says

    November 22, 2016 at 12:16 PM

    Good article.

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Article by Fred Schenkelberg
in the Musings series

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • Leadership Values in Maintenance and Operations
  • Today’s Gremlin – It’ll never work here
  • How a Mission Statement Drives Behavioral Change in Organizations
  • Gremlins today
  • The Power of Vision in Leadership and Organizational Success

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy