Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Podcasts
  • Courses
    • Your Courses
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
  • Barringer Process Reliability Introduction Course Landing Page
  • Upcoming Live Events
You are here: Home / Articles / Maintenance Tactics: A Practical Guide to Balancing Cost, Reliability, and Risk

by Miguel Pengel Leave a Comment

Maintenance Tactics: A Practical Guide to Balancing Cost, Reliability, and Risk

Maintenance Tactics: A Practical Guide to Balancing Cost, Reliability, and Risk

In most industries we ideally want the lowest-cost maintenance strategy, one that balances risk of unplanned vs planned maintenance costs to a lifetime minimum.

This is what we’ve calculated in a previous article HERE, and this article/exercise is a follow-on from this.

The downside of using this method is that it may select a maintenance interval where the inherent reliability of the component at changeout time is less than is required for operational reasons, or that we’re not getting the best “increase in reliability vs marginal cost” value.

This brings us to three types of maintenance intervals:

–            Optimal Interval for the lowest lifecycle cost

–            Optimal Interval for the best reliability vs marginal cost trade-off

–            Interval set for the required component reliability

Read more: Maintenance Tactics: A Practical Guide to Balancing Cost, Reliability, and Risk

I this article, we will be covering these three main types cost vs risk maintenance strategies, how they are quantified, and when to use them. We will delve through some practical exercises to show how the model and calculations fit together.

During this exercise we will also put dollar figures to maintenance strategy choices, as in reality, this is the simplest and most effective way of communicating cost-benefit to management.

The worked Excel sheet can be downloaded after filling in the form at the end of the article, and is recommended to do so before working through this article for a better understanding.

Optimal Interval for the lowest lifecycle cost:

This strategy is usually most Reliability Engineer’s their go-to. If achieving the lowest maintenance cost of a component, while balancing the cost of downtime is the goal, then this is the method to use.

Here we calculate the cost of planned maintenance, as well as the cost of corrective (unplanned) maintenance and calculate the combined cost for each successive maintenance interval.

As the maintenance interval increases, the risk of failing the component also does (if Beta >1), and this factor is included in the cost calculation.  

This cost associated with failure risk is derived from the results in the Weibull Analysis done HERE, and the Beta and Eta values have been carried over into this exercise to help with the calculations.

When equalizing this into a cost/hr, we will get a minimum, which is the maintenance interval at which we will achieve the lowest cost over the life of the part. In our case, this is at 3,500 hours.

Because cost/hr is sometimes not as clear as just one simple value, we can use the below method to calculate the total cost of each interval, and subsequently, the cost difference between the “lowest-cost” interval, and any other possibly interval.

This is done by multiplying the Cost/Hr of each respective interval by the interval value itself to get the “Total Cost” and subsequently proportioning it by the amount this interval fits into the Lowest Cost Interval of 3,500 hours. This is the “Cost during Life” column in the Excel sheet. When calculating the “Cost during Life”, you need to use the Optimal cost interval as the expected life. This is because we will assume you will adopt this interval, and all your cost-benefit comparisons will thus be based off this number.

After this we simply subtract each maintenance intervals Cost during Life by the one at 3,500 hours to get the total deviation. This plot is shown below and has the same shape as the cost/hr chart covered above

The problem with this method is that the optimal interval in terms of cost might be associated with a reliability value that is unacceptable. In our example, the optimal interval of 3,500 hours equates to a reliability of only 82%. Now this may be OK depending on your circumstances, however keep the following points in mind:

  • Reliability of 82% for a component in a parallel circuit where it has redundancy will still ensure the “system” function has a higher reliability. So its important to consider if this component is in series, or has redundancy.
  • If the component is in series, 82% might not be enough. Examples include safety critical systems such as airbags, ejection seat cartridges, isolators. These sort of systems require a high level of reliability that the lowest cost maintenance/ replacement interval might not satisfy
  • Mining production targets and budgets are usually built around availability targets of the equipment. This is because it is important that the machine is available to extract commodities for a certain percentage of available operations time. No use saving a few thousand dollars in  maintenance if it results in millions in lost production.

Optimal Interval for the best reliability vs marginal cost trade-off

This interval is the best “bang-for-buck”, and is not the lowest cost, or the highest reliability, but balances the trade-off between the two. Before we delve into calculating this one, let’s conceptually understand why this exists using our excel example as a reference.

In the previous calculation, we got the lowest cost maintenance interval. But let’s have a closer look at the excess cost of deviating from this interval, and the associated reliability.

If I dropped my maintenance/ replacement interval from 3500 hours to 1000 hours, I would have a cost deviation of $76,300, with a reliability increase from 82% to 98 %.

If I were to go from 1000 hours to 500 hours, the costs exponentially shoots up to $218,000 and reliability only goes up to 99%. So, an almost double increase in cost for only 1% increase in reliability.

What’s happening here is that because we have a Beta value of more than 1, and the part is experiencing wear-out failure, and the failure risk cost increases exponentially with time.

Thus, for every decrease in maintenance interval, we will experience diminishing returns when comparing the increase in component reliability with increase in maintenance costs from this reduced interval.

There is of course an optimal trade-off here, and that value is when the value of “failure risk saving” minus “excess cost of maintenance” is highest. What this means is that in order to have the best reliability vs cost trade off, the cost of our increased reliability (that is, the more frequent maintenance intervals) should always be less than the marginal savings associated with the reduced failure risk.

Graphically, this is shown in the figure below in the etched area of cross-over when plotting excess cost from optimal vs failure risk cost saving.

The absolute optimum is where this value is maximized, and is shown in the figure below, and sits at about 2250 Hours.

Interval set for the required component reliability

This is the simplest interval, and is used when we must achieve a minimum reliability. This is mainly used for safety critical components.

We simply use the derived Reliability Function that we got out of our Weibull Analysis, select the target reliability and locate the corresponding Hours. This is your minimum interval, and probably isn’t going to be the lowest cost, or the best reliability vs cost trade-off, but that’s all that you have to work off in this sort of situation.

Conclusion

As a Reliability Engineer, remember that your role is to “Engineer”. This means developing the most optimal solution dependent on the constraints set before you. As we’ve seen, sometimes having the lowest cost maintenance interval can’t be the answer, and we must adapt our maintenance tactics to balance cost, availability and risk.

The worked Excel example can be downloaded at the bottom of the original article HERE.

Filed Under: Articles, Asset Management in the Mining Industry, on Maintenance Reliability

About Miguel Pengel

Miguel Pengel is a Registered Professional Engineer based in Australia, with experience in a wide variety of heavy industries such as Mining, Smelting and Power Generation.

He holds a degree in Mechanical & Aerospace Engineering, as well as a Masters in Applied Finance from the University of Queensland.

Miguel has worked in Reliability, Mechanical, Production and Project engineering roles for some of the world's largest companies such as Glencore, Hitachi and Rio Tinto.

« Remember PROACT V1
What Kinds of Assets can RCM be Applied to? »

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Asset Management in the Mining Industry logo Photo of Miguel PengelArticles by Miguel Pengel
in the Asset Management in the Mining Industry: A Blueprint for Operational Excellence article series

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Posts

  • Today’s Gremlin – It’ll never work here
  • How a Mission Statement Drives Behavioral Change in Organizations
  • Gremlins today
  • The Power of Vision in Leadership and Organizational Success
  • 3 Types of MTBF Stories

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy