Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • CMMSradio
    • Way of the Quality Warrior
    • Critical Talks
    • Asset Performance
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Hero
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Breaking Bad for Reliability
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • The RCA
      • Communicating with FINESSE
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Hardware Product Develoment Lifecycle
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Special Offers
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • Your Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
      • FMEA Introduction
      • AIAG & VDA FMEA Methodology
    • Barringer Process Reliability Introduction
      • Barringer Process Reliability Introduction Course Landing Page
    • Fault Tree Analysis (FTA)
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
    • Accendo Reliability Webinar Series
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
Home » Articles » Page 143

Articles

Find all articles across all article series listed in reverse chronological order.

by Fred Schenkelberg Leave a Comment

Creating a Dot Plot

Creating a Dot Plot

Graphs contain information and often tell a story. Our interpretation of the graphic can be aided or hindered by the design or style of the plot. Cleveland and McGill (1984) studied graphical perception and found the use of dot plots to aid viewers to understand the data’s message clearly.

The nature of a dot plot is like a bar chart, yet without the bars. Less ink, just a dot to indicate count or position along an axis permits conveying information simply. Due to its simplicity, it also permits adding additional information useful for comparisons or spotting trends, and more. [Read more…]

Filed Under: Articles, CRE Preparation Notes, Probability and Statistics for Reliability

by Fred Schenkelberg Leave a Comment

Dealing with Reliability Related Uncertainty

Dealing with Reliability Related Uncertainty

Uncertainty is another word for risk. Reliability uncertainty or risk is neither good nor bad, it just a bit unknown. Until we know the outcome, the eventual reliability performance, we will not know the impact.

So, how do we deal with reliability uncertainty? Will our product or system work as expected over time, or will it fail? Let’s examine a few of the common approaches in use and when and why the approach is effective. [Read more…]

Filed Under: Articles, Musings on Reliability and Maintenance Topics, on Product Reliability

by Robert (Bob) J. Latino Leave a Comment

Root Cause Analysis: The Justification Game

Root Cause Analysis: The Justification Game

In last month’s introductory article, we discussed some of the barriers to selling not only the concept of Root Cause Analysis (RCA) but also the recommendations generated as a result of these analyses. We also laid the framework for making better predictions by generating and accessing better data—namely predicting our Mean Time between Failure (MTBF) and Mean Time to Restore (MTTR) earlier and, therefore, implementing a fix faster.

Given this background, let’s explore how we can now justify conducting a RCA and implementing the recommendations as a result of the analysis. By and far, conducting a true RCA is viewed as luxury not a necessity. Think about the objections we hear when we offer the idea of gathering RCA teams. What follows is a list of common objections to RCA accompanied by rational justifications that any manager can employ. [Read more…]

Filed Under: Articles, on Maintenance Reliability, The RCA

by Doug Lehr Leave a Comment

The Importance of Post-delivery Feedback

The Importance of Post-delivery Feedback

In the article Reliability in Equipment Design it was stated that a project team can set reliability target(s) for new equipment using historical reliability data. One option OEMs have for capturing historical reliability data is a Failure Reporting And Corrective Action System, or FRACAS. The data captured can provide a comprehensive view of how the equipment is performing. This data can be used to improve legacy equipment and drive decision-making on new equipment. [Read more…]

Filed Under: Articles, Equipment Risk and Reliability in Downhole Applications, on Risk & Safety

by Christopher Jackson Leave a Comment

Do Degrees Hurt or Help?

Do Degrees Hurt or Help?

Remember how we were told as kids in school that we had to study hard to get into college or university? There was that ongoing, implied threat that not getting into university or college would ruin our career. Or at least, be very bad.

It turns out it may not be that bad after all. For example, over half of Apple’s employees don’t have 4-year degrees. Why? [Read more…]

Filed Under: Articles, on Product Reliability, Reliability in Emerging Technology

by James Kovacevic Leave a Comment

Failure Reporting, Analysis and Corrective Action System (FRACAS)

Using a System to Record, Report And Eliminate Defects

Why is that some organization seem to break the reactive cycle and others don’t?  After all most organizations have a PM program and some form of a planning and scheduling program right?   The key difference between those that do is their ability to use their failure data and systematically eliminate defects and issues from the processes and equipment.  This doesn’t mean adding a new PM everytime some fails, which just won’t work.

To eliminate the defects and issues, the organization needs to collect meaningful data to analyze and act on.  This is where FRACAS comes in. [Read more…]

Filed Under: Articles, Maintenance and Reliability, on Maintenance Reliability Tagged With: Failure Reporting Analysis and Corrective Action System (FRACAS)

by Greg Hutchins Leave a Comment

Where’s Our Next Shock Coming From?

Where’s Our Next Shock Coming From?

Guest Post by Geary Sikich (first posted on CERM ® RISK INSIGHTS – reposted here with permission)

The Coronavirus pandemic (COVID19) continues to spread throughout the world despite the best efforts of governments and the medical community.  With no vaccine becoming available in the near term and perhaps even in the long term we are faced with growing uncertainty.  It seems that ever since the emergence of COVID19 that conflicting advice, information and guidance has exacerbated the situation and created a lack of trust on the part of the public on what measures to take.  Wear a mask. 

Don’t wear a mask.  Social distance.  Don’t social distance.  The only advice that seems trustworthy is “Wash your hands”; and, that is something that we should do regardless of the pandemic – it’s good common sense. [Read more…]

Filed Under: Articles, CERM® Risk Insights, on Risk & Safety

by James Reyes-Picknell Leave a Comment

Maintenance Excellence – no quick fixes

Maintenance Excellence – no quick fixes
We all love instant gratification. Some of us can’t resist temptations, some give in some of the time, others prefer to defer for greater reward later. Never-the-less, regardless of our age, we have grown used to getting what we want, generally quickly and limited only by our ability and willingness to pay for it.

[Read more…]

Filed Under: Articles, Conscious Asset, on Maintenance Reliability

by Adam Bahret 2 Comments

HALT shouldn’t be “H.A.L.T.”

HALT shouldn’t be “H.A.L.T.”

The fist part of this post you likely already know.  It’s the second part that may be helpful.

I love HALT testing and almost always include it in a new program.  With a team new to the concept there is always the hurdle of getting them to understand it’s value.  It’s not intuitive to see value in destroying a product with stepped stresses.  Often these stresses aren’t even apart of the product’s use case. Why vibrate a lab electronic device that spends its entire life on a bench? Seeing the failure mode is a capacitor flying off the PCB at 50 G’s doesn’t reinforce the value of the activity without some explanation.

[Read more…]

Filed Under: Apex Ridge, Articles, on Product Reliability

by Fred Schenkelberg Leave a Comment

Building and Using Pareto Charts

Building and Using Pareto Charts

You may have heard of the 80/20 rule. The idea is that 80% of the wealth is held by 20% of the population. As an Italian economist, Vilfredo Pareto made this observation that became generalized as the

Pareto Principle: 80% of outcomes are due to 20% of causes

For field returns, for example, we may surmise that 80% of the failures are due to 20% of the components, for example. This principle helps us to focus our work to reduce field failures by address the vital few causes that lead to the most, or most expensive, failures. [Read more…]

Filed Under: Articles, CRE Preparation Notes, Data Collection and Use, Probability and Statistics for Reliability

by Nancy Regan Leave a Comment

Is Reliability Centered Maintenance (RCM) just for airplanes?

Is Reliability Centered Maintenance (RCM) just for airplanes?

Have you ever heard that Reliability Centered Maintenance (RCM) is just for airplanes? Don’t believe it! Here’s how RCM came to be one of the most effective (and universal) Reliability improvement efforts an organization can implement. [Read more…]

Filed Under: Articles, Everyday RCM, on Maintenance Reliability

by Christopher Jackson Leave a Comment

How Company Visions Make or Break Reliability

How Company Visions Make or Break Reliability

Every organization needs to be able to explain ‘why’ it is here. In fact, an organization’s ‘why’ is . Check out Simon Sinek. One of his favourite phrases is that ‘people don’t buy what you do – they buy why you do it.’

People buy Apple products because of they enjoy how they interface with devices that are part of a bigger and seamless ecosystem. All their competitors try to emulate this. Amazon is all about customer experience. [Read more…]

Filed Under: Articles, on Product Reliability, Reliability in Emerging Technology

by Robert (Bob) J. Latino Leave a Comment

Viewing a Hospital as a System: A Reliability Perspective

Viewing a Hospital as a System: A Reliability Perspective

Veteran professionals in the Reliability field view every business as a system. All systems have 1) inputs, 2) a transformation of those inputs in some form or fashion and 3) outputs. Just think about that for a minute; think about your schools, banks, manufacturing plants, small businesses…they are all systems. [Read more…]

Filed Under: Articles, on Maintenance Reliability, The RCA

by Fred Schenkelberg Leave a Comment

A Two-Step Approach to Get Better at What You Do

A Two-Step Approach to Get Better at What You Do

How is it that some people continue to get better at managing meetings, designing complex test plans, making presentations, or solving problems? How in general do people improve their performance over time at something? [Read more…]

Filed Under: Articles, Musings on Reliability and Maintenance Topics, on Product Reliability

by James Kovacevic Leave a Comment

Establishing the Frequency of Failure Finding Maintenance Inspections

Preventing The Consequences Of A Hidden Failure From Devastating Your Organization.

Ever wonder how some of the worst industrial disasters occur?  It is usually the result of multiple failures.  Failure of the primary system and failure of the protective systems.   Ensuring the protective system(s) are not in a failed state should be of utmost importance to any organization.  But how often should we test the protective systems to ensure the required availability?

Establishing the correct frequencies of the inspection/ testing activities of these protective system(s) is critical to not only the success but safety and reputation of any organization.   Too infrequently and the organization is at risk of a major incident.  Too frequently, and the organization is subjected to excess planned downtime, an increased probability of maintenance induced failures and increased maintenance cost.
This article will continue the discussion on establishing the correct inspection frequency in a maintenance program.  There are three different approached to use, based on the type of maintenance being performed;

  • Time-Based Maintenance
  • On-Condition Maintenance
  • Failure Finding Maintenance

This article will focus on Failure Finding Maintenance.

What Are Protective Systems, Hidden Failures and Failure Finding Maintenance

A protective system or device is a system or device which is designed to protect and mitigate or reduce the consequences of failure.  These consequences may be safety, environmental or operational in nature.   These devices or systems are designed to;

  • Alert – to potential problem conditions (i.e. alarm)
  • Relieve – prevent failure conditions causing greater problems (i.e. pressure relief valve)
  • Shutdown – stop a process to prevent greater problems from occurring (i.e. motor overload)
  • Mitigate – alleviate the consequences of a failure (i.e. fire suppression equipment)
  • Replace – continue to provide a function by an alternative means (i.e. back up pump)
  • Guard – prevent an accident from occurring  (i.e. E-Stop)

Knowing what a protective device or system is, you may see that if a pressure relief valve became corroded and seized in the closed position, it would not be evident to the operators.   This is a hidden failure.   A hidden failure can be defined as; a failure which may occur and not be evident to the operating crew under normal circumstances if it occurs on its own.  Obviously, this could lead to significant consequences if the tank that the pressure relief valve is protecting is overpressurized.   This is where failure finding maintenance comes in.

Failure-finding maintenance is a set of tasks designed to detect or predict failures in the protective systems or devices to reduce the likelihood of a failure in the protective system and the regular equipment from occurring at the same time.  So how to do you determine how often the protective systems should be checked for failure?  Establish the frequency using a formula.

Establishing Failure Finding Maintenance Frequencies Using Formulas

There is a single formula that will take into consideration of all variables to establish the failure finding interval (FFI);  FFI = (2 x MTIVE x MTED) /MMF

Where;

  • MTIVE = MTBF of the protective device or system
  • MTED = Mean Time Between Failure of the Protected Function
  • MMF =   Mean Time Between Multiple Failures

So if we use an example from RCM2, we can see how this works; The users of a pump and a standby pump want the following from the system.

  • The probability of a multiple failure to be less than 1 in 1000 in any one year (MMF)
  • The rate of unanticipated failures of the duty pump is 1 in 10 years (MTED)
  • The rate of unanticipated failure of the standby pump is 1 in 8 years (MTIVE)

Therefore the correct failure finding interval would be;

  • FFI = (2 x 8 x 10) / 1000
  • FFI = (160)/1000
  • FFI = 0.16 years
  • 0.16 years x 12 months = 2 months

This indicates that the standby pump must be checked every two months to verify it is fully operational.   If this check is not performed, the likelihood of a multiple failures increases.

Lastly, if the failure of the protective device can be caused by the failure finding task itself, there is another approach to be used, which is beyond the scope of this article.

Do you have a program in place to check your protective systems?  If not, are you aware of the risk that your organization is exposed to?   Take the time to determine your protective systems and establish your failure finding tasks.

Remember, to find success; you must first solve the problem, then achieve the implementation of the solution, and finally sustain winning results.

I’m James Kovacevic
Eruditio, LLC
Where Education Meets Application
Follow @EruditioLLC

References;

  • RCM2 by John Moubray
  • Fixed Time Maintenance
  • On-Condition Maintenance

 

Filed Under: Articles, Maintenance and Reliability, on Maintenance Reliability

  • « Previous Page
  • 1
  • …
  • 141
  • 142
  • 143
  • 144
  • 145
  • …
  • 278
  • Next Page »
[show_to accesslevel="Free" ] Thanks for being a member [member_first_name]! [/show_to][hide_from visible_to='public']

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member. It’s free and only takes a minute.

Join Today

[/hide_from]

Recent Articles

  • Evaluating Risk: When to Apply a Risk Matrix or Monte Carlo Analysis
  • Illuminating MTBF’s Lack of Information
  • Return parts analysis – why?
  • How to Make ISO 55000 and ISO 55001 Successful 
  • Killer AI and Risk Based, Decision Making

© 2026 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy

Book the Course with John
  Ask a question or send along a comment. Please login to view and use the contact form.
This site uses cookies to give you a better experience, analyze site traffic, and gain insight to products or offers that may interest you. By continuing, you consent to the use of cookies. Learn how we use cookies, how they work, and how to set your browser preferences by reading our Cookies Policy.