Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Podcasts
  • Courses
    • Your Courses
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
  • Barringer Process Reliability Introduction Course Landing Page
  • Upcoming Live Events
You are here: Home / Articles / Do the Best You Can With Available Data?

by Larry George Leave a Comment

Do the Best You Can With Available Data?

Do the Best You Can With Available Data?

Lifetime data is nice to have, but lifetime data is not necessary! Generally Accepted Accounting Principles require statistically sufficient data to estimate nonparametric reliability and failure rate functions. Some work is required!

ISO 14224 “Petroleum, Petrochemical and Natural Gas Industries—Collection and Exchange of Reliability and Maintenance Data for Equipment” requires lifetime data to estimate exponential or Weibull reliability functions! Sales or ships and returns or failure counts are statistically sufficient to make nonparametric estimates of reliability and failure rate functions, without unwarranted distribution assumptions or lifetime data!

LILCO 1911-1998: Long Island Lighting

I was in the maintenance manager’s office when someone interrupted; a disgruntled employee was making threats. Manager disappeared. Later the manager returned. I explained how to estimate reliability from installed base and failure data required by GAAP (Generally Accepted Accounting Principles). Manager said, “We don’t even have records of our equipment, even purchase records!”

CMMS Software Implements Standards

At a Houston reliability meeting, 1977, oil company consultants bragged about their yachts. Has there been progress since then?

“…the Achilles heel of these technologies (AI and ML) is poor data quality. By some estimates, at best 3% of historical equipment performance data meet basic quality standards.” [Ciliberti]

“Depending on the sector in which your business operates, you may be subject to regulatory compliance standards that you must adhere to.” “Inspections and audits assess the business’s adherence to the different standards.” “CMMS Software can help a business meet regulatory compliance standards more painlessly.” [Morsillo]

Field reliability data led to selling failure rate estimates, which led to writing standards for data collection, which led to auditing to assure compliance with standards, which led to sales of CMMS software to implement standards. For example: OREDA (Offshore REliability Data) started ISO 14224 to get more data so it could publish more failure rates in the OREDA Handbook [Gjerstad] and now in the “cloud” https://oreda.com/. This is known as “Materialist Capture” or “Financial Capture”.

ISO 14224 Attempts to be Constructive?

“As of today, monitoring and follow-up of technical status of SIS components require considerable manual effort by extracting and interpreting maintenance and failure data from various systems and sources.” [Hauge et al.]

Ali Bashir says, “Results from reliability analytics and usage of codes are often not communicated back to the shop floor. It is important for people to realise that the codes that are used and the information put into CMMS is referenced to make data driven decisions.”

In “ISO 14224-Implementation in 6 Easy Steps,” Roy Milne says, “…how reliability will consume the data, analyze it, validate it and provide recommendations.” 

ISO 14224 requires reliability lifetime data, which makes CMMS database programming easy; it also recommends estimation of constant failure rates or Weibull reliability functions. ISO 14224 avoids ISO 31000 risk, uncertainty, and PRA (Probabilistic Risk Analysis); it focuses on reliability as probability based on field data [Terje Aven]. ISO 14224:2016 introduced “Uncertainty” [Selvik et al. quibble].

What is My Problem with ISO 14224? 

ISO 14224 requires reliability lifetime data. That data is sufficient but not necessary to make nonparametric reliability estimates. Lifetime data costs a lot, requires a lot of work to track every part by name and serial number, and incurs errors and missing data. 

ISO 14224 recommends constant failure rate or Weibull estimation. Nonparametric reliability estimation, without life data, has been done for 35 years. Data required by GAAP is free, contains lifetime information, is not supposed to have accounting errors, and is population data!

ISO 14224 recommends baseball reliability statistics: Paretos, RCA (Root-Cause Analysis), RGA, MCF, and Markov or Simulation. 

  • Paretos are OK statistics. RCA is OK too. Assuming exponential or Weibull reliability requires justification. Lifetime data is unnecessary for exponential, Weibull or nonparametric reliability estimation. ISO 14224 requires data sufficient for nonparametric reliability estimation but never mentions nonparametric statistics, multivariate statistics, or dependence.
  • RGA (Reliability Growth Analysis a la AMSAA, Larry Crow) is really MTBF growth analysis. Want reliability function growth analysis [https://fred-schenkelberg-project.prev01.rmkr.net/statistical-reliability-control/#more-522710/]? 
  • MCF (Mean Cumulative Function) is for renewal processes. MCF gives averages only, [Nelson, https://fred-schenkelberg-project.prev01.rmkr.net/predicting-repair-rates-plots/#more-289657/] not reliability functions [George 2004].
  • Markov analysis requires transition rates. Markov transition rates are nonparametric, actuarial failure or repair rates.
  • Simulation depends on correctness of distribution assumptions and dependence (copulas).

What People Say About ISO 14224?

“…ISO 14224 was developed for the Petroleum, Natural Gas and Petrochemical industries, the same process can and should be applied in any organization” [Kovacevic, italics are mine]. Organizations that use GAAP have reliability statisticians that use accounting data to collect periodic product installed base by age, failure counts or spare parts’ usage, and engineers have BoMs of service parts that go into product installed base by age to convert into service parts’ installed base by age. Gozinto theory or MRP (Materials Requirements Planning) says P*(I-N)-1 tells parts installed base from P = the vector of product installed base by age and N is the next-assembly matrix that tells how many of each part gozinto each next assembly [https://fred-schenkelberg-project.prev01.rmkr.net/gozinto-theory-parts-installed-base/#more-417514/].

“I have personally wasted hundreds of hours of my work life sifting through a poor CMMS structure trying to find records” [Ferrari]. Sorry, you might have looked in wrong place(s). Installed base and returns or failure counts can be derived from data required by GAAP. Revenue = Σships*price and Service Cost = Σreturns*cost per return. You may have to work to dig up the ships and returns counts, but accountants have to report revenue and cost elements. I confess that I had to look up Apple computer sales in industry publications tp convert computer sales to parts’ installed base using the gozinto formula.

Compare ISO 14224 lifetime data requirements vs. ships and returns counts. ISO 14224 requires lifetime data, which may be summarized. Table 1 (“Nevada table”) contains cohorts and grouped failure counts or equivalent for units tracked by name and serial number: start time or date, failure time or date for each product or part. Grouped lifetime data is OK, but a Nevada table is not necessary. The bottom row of table 1 contains period ships and failures or returns without cohort id. Table 2 data is statistically sufficient to make nonparametric estimates of reliability and failure rate functions.

Table 1. Nevada table of periodic ships and grouped failure or return counts. Bottom row sums monthly failure counts.

MonthShipsJanFebMarAprMayJun
Jan 35193637103
Feb 6292 38203524
Mar 7132  8132531
Apr 5633   4136
May 4222    58
Jun 4476     6
Returns 31019458878

Table 2. Statistically sufficient data from table 1. 

MonthShipsReturns
Jan 35193
Feb 629210
Mar 713219
Apr 563345
May 422288
Jun 447678
Figure 1. Nonparametric maximum likelihood estimators: Kaplan-Meier (KM) R(t) from table 1 and maximum likelihood from table 2 monthly ships and returns counts (S&R) R(t).
Figure 2. Weibull reliability least-squares estimators fit to the Kaplan-Meier (KM) and ships and returns counts (S&R) R(t).

Table 1 contains more information than table 2, because the table 1 grouped failure counts are identified by cohort. Table 2 failures could have come from any prior cohort. Kullback-Leibler divergence measures this information: Σf(t|KM)ln(f(t|KM)/f(t|S&R)) = -0.00087 bits for t=1,2,3,4,5. (f(6|S&R)=0 blows up the formula.) The Kullback-Leibler divergence for the Weibull fits is -0.004 bits. Those are small differences in information. 

The OREDA handbook 2015 edition costs NOK 4000, (~$384 USD). SuperSmith Weibull software costs 974 Euro (~$1052 USD). “Entry-level CMMS software costs $29-$79/user/month, while subscription-based CMMS onboarding spans $600-$2,500 in upfront expenses” [softwareconnect.com/learn/best-cmms-software-pricing/]. How much work would it take to get periodic ships and returns or failure counts from your company’s accounting system? Is it worth 0.00087 bits of information from CMMS or 0.004 bits if you use Weibull software?

ISO 14224 Sections vs. Corresponding GAAP data

Numbers refer to ISO sections.

1. ISO 14224 requires costs but GAAP + work supplies cost data.

2. ISO 14224 requires parts’ start times but GAAP supplies product sales or installations by period, revenue=Σprice*sales. I admit I looked up Apple computer, Firestone tire, and Ford monthly sales in industry publications. Ron Salzman, Ford, sent me actual production, when I sent him field reliability estimates.

3. ISO 14224 requires hierarchical parts’ classification but BoMs (Bills of Materials) supply parts counts and next-assembly matrix N for MRP or gozinto; P(I-N)-1 converts product installed base P into parts installed base by age [George, https://fred-schenkelberg-project.prev01.rmkr.net/gozinto-theory-parts-installed-base/#more-417514/].

4. ISO 14224 requires failure modes, which may be apparent from specifications? FMEA? FRACAS? ISO 14224 doesn’t recognize dependent failure modes or multivariate reliability.

7. ISO 14224 requires fail dates but GAAP requires periodic reporting of service costs, spare parts’ sales, complaints, etc.; J&J Glucometer complaints were nearly always led to replacements.

Published sales and failure data? Medtronic used to publish pacemaker sales, failure counts, and their reliability estimates,…until I sent them nonparametric field reliability estimates that agreed with theirs from their lifetime data.

Do the best you can with…

Nonparametric estimation of reliability from lifetime data grouped by cohort uses the Kaplan-Meier estimator. Nonparametric estimation from ships and returns uses maximum likelihood or least squares [George 2023]. Markov analysis requires actuarial (nonparametric, age-specific) transition rates. Simulation is GIGO if distributions are wrong. Multivariate reliability statistics quantifies dependent failure modes; e.g. seismic, e.g., automotive aftermarket customers who bought A also bought B.

André-Michel Ferrari asked how to estimate reliability from incomplete renewal process failure data for bearings started in 2008 from CMMS data started in 2015 (table 3). Thanks for permission to use the example. Bearings started service 1/1/2008. CMMS started 7/1/2015 collecting failures (“change”) dates. What about lost failures prior to CMMS? 

Table 3. Bearing change records TTF is since 1/7/2015. Some are successive failures of at same location. (“Date of change” is in Canadian format.)

Bearing IdTTF monthsTTF monthsDate of change
F153531/12/2019
F122232/11/2021
F271727/27/2021
F371727/27/2021
F35411/2/2021
P1606026/06/2020
P4313221/02/2018
P5393915/10/2018
P5161616/02/2020
P5262712/5/2022
P6353623/06/2018
P8878818/10/2022
P6945917/05/2023

Bearing manufacturers claim Weibull reliability [Xintao et al.], so assume a Weibull renewal process describes times between failures; TBF reliability is Exp[‑(t/η)β]. Simulate successive lifetimes from 1/1/2008 for each bearing Id and estimate Weibull parameters to match quarterly observed CMMS failure counts. Include zero-failure counts from 7/1/2015 to first recorded failure in 2018. 

Find Weibull parameters eta and beta to minimize the sum of squared errors (SSE) between observed and simulated 2015-2023 quarterly failure counts. Entries in table 4 are Excel COUNTIFIS() functions, mostly 0 and 1, that tell how many simulated Weibull renewals are in each calendar quarter. There are two “Simulated” rows in table 4, because I replicated the simulation, two times for each bearing: F1, F2, F3, P1, P4, P5, P6, and P8. I used Solver several times (because simulation changes TBFs) to find eta and beta to minimize SSE. Column B contains latest solution. Column C contains a copy of previous Solver solution. I ran Solver six times to get the mean, standard deviation and coefficient of variation for the Solver solutions. Replicating the Solver solution reduced the coefficient of variation from ~6.2% for eta and 5.3% for beta. 

Table 4. Simulated quarterly failure counts from data in table 3, start date 1/1/2008 and CMMS start date 7/1/2015.

Quarter7/1/201510/1/2015Etc.4/1/20237/1/202310/1/2023
Failures00 100
Simulated00 000
Simulated00 000

Table 5. Estimation of Weibull parameters. Column B contains current Solver solution; column C contains copy of previous solution. 

SSE3736Etc.meansdcv
Eta7962.47962.4 7924.758.180.73%
beta1.8551.855 1.800.0754.18%
mean7071.67071.6 7049.952.070.74%

The connection between Weibull parameters and SSE may not be obvious. Table 4 row labeled “Failures” counts observed quarterly renewals in table 1, including zeros. Table 2 rows labeled “simulated” count simulated quarterly renewals. SSE is SUM(observed-simulated)2 summed over all quarters since 7/1/2015. Solver adjusts eta and beta to minimize SSE. (I could have used months in table 4 instead of quarters, but I felt like aggregating into quarterly counts might help Solver.) Perhaps Bayesian methods would help supplement data in table 3 [Dykstra and Laud].

New Reliability Methods?

Estimating reliability from ships and returns counts is not new! People have been saying, “You need lifetime data to estimate reliability functions.” I estimated maximum likelihood nonparametric reliability functions for Apple Computer service parts from computer sales and parts’ returns counts (1990). I started doing that for auto parts’ renewal processes using least squares (1995).

Consider multivariate reliability function estimation in different failure modes [Maathuis, George]. E.g., COVID-19 daily case, death, and recovery counts. HIV+->AIDS->death counts? With multiple end-point or failure modes, the renewal process likelihood function is L=Πf*k(t(j)), where (f*k(.) is convolution) for k-th failure of j-th component f(t(j)) different for each (kind of) component.

Compare ISO 14224 Required Lifetime Data reliability estimates vs. ships and returns reliability estimates using the Kullback-Leibler divergence to quantify difference between alternative data. There is more information in lifetime data, even censored, than in ships and returns counts. Some of that information is wiped out by assuming constant failure rate or Weibull reliability. K-L divergence is bits of information (difference). Use cost data to convert bits to $$$ [George 2019, “Statistical Reliability Control”].

Are you paying for your CMMS to implement ISO 14224? Are you paying to put data into the CMMS? Are you paying for Weibull S/W? Are you paying auditors to certify compliance with ISO 14224? OREDA cloud data costs EURO 300/year+. 

GAAP data is free, but you have to work to dig ships and returns counts out of company financials! Do the best you can with available data. I’ll help; send data to  pstlarry@yahoo.com., describe it, and ask for nonparametric field reliability estimates. If your installed base is products, send their BoMs or gozinto matrix N along with product ships and parts’ failure counts.

ISO 14224 Contacts

ISO TC67/WG4, “Reliability Engineering and Technology”, https://www.iso.org/committee/49506.html/

ISO TC67 US representative is Tony Ciliberti, Reliability Dynamics LLC, tony.ciliberti@rd-eam.com,www.reliabilitydynamics.com/

Oil and gas industries including lower carbon energy, www.iso.org/committee/49506.html/

Tony.Krakenes@SINTEF.no, https://www.sintef.no/en/projects/2015/oreda-handbook2/

References

ISO 14224(2006)E, “Petroleum, Petrochemical and Natural Gas Industries — Collection and Exchange of Reliability and Maintenance Data for Equipment,” the International Organization for Standardization, 2006

Terje Aven, “The Flaws of the ISO 31000 Conceptualisation of Risk,” Editorial, J. Risk and Reliability, Vol. 231, Issue 5, pp. 467-468, Oct. 2017

Terje Aven, “Forget the traditional risk matrix – better alternatives exist,” Risk analysis, 2nd. Ed. Wiley. https://drive.google.com/file/d/1X0GiZsiUAsmtuHh4U6ujLftxy2QdG5uB/view/, 2015

Ali Bashir, “Implementing ISO 14224 in Six Steps,” https://www.linkedin.com/pulse/implementing-iso-14224-six-steps-ali-bashir/, May 2017

Tony Ciliberti, “Use ISO 14224 Methods to Optimize Equipment Performance Data Quality and Results from Artificial Intelligence and Machine Learning,” ANSI Blog 

R. L. Dykstra and Purushottam Laud, “A Bayesian Nonparametric Approach to Reliability, The Annals of Statistics, Vol. 9, No. 2, pp. 356-367, https://doi.org/10.1214/aos/1176345401/, 1981

André-Michel Ferrari, “10 Crucial Attributes to Optimize Your CMMS Set-up for Reliability Analysis,” 10 Crucial Attributes to Optimize your CMMS set-up for Reliability Analysis (accendoreliability.com)/, Weekly Update, Feb. 24, 2024

L. L. George, “Actuarial Forecasts for the Automotive Aftermarket,” SAE Transactions, Vol. 113, Section 5: Journal of Materials and Manufacturing, pp. 697-701, 2004

L. L. George, “Random-Tandem Queues and Reliability Estimation, Without Life Data,” RandTand.pdf – Google Drive/, 2019

L. L. George, “Gozinto Theory and Parts’ Installed Base,” Weekly Update, https://fred-schenkelberg-project.prev01.rmkr.net/gozinto-theory-parts-installed-base/#more-417514, Sept. 2023

Torkell Gjerstad, “The OREDA Handbook and Its Role In Offshore Risk Analysis,” https://oreda.com/handbook/, 2009

Stein Hauge, Solfrid Håbrekke, Mary Ann Lundteigen, and Lars Bodsberg, “Standardised Failure Reporting and Classification of Failures of Safety Instrumented Systems,” Proceedings of the 30th European Safety and Reliability Conference and the 15th Probabilistic Safety Assessment and Management Conference, 2020

James Kovacevic, “Understanding ISO 14224: Your Guide to Sustainable Defect Elimination,” https://fred-schenkelberg-project.prev01.rmkr.net/understanding-iso-14224-guide-sustainable-defect-elimination/#more-204782/, April 2017

Marloes H. Maathuis, “Nonparametric Maximum Likelihood Estimation for Bivariate Censored Data,” Thesis submitted to the Delft University of Technology for the degree of Master of Science/Wiskundig Ingenieur, January 2003

Roy Milne, “ISO 14224 – Implementation in 6 Easy Steps”, https://www.linkedin.com/pulse/iso-14224-implementation-6-easy-steps-roy-milne/, Jan. 2019

Tony Morsillo, “6 Ways CMMS software will Help you Breeze Through Audits,” https://zoidii.com/blogpost/cmms-audits/, March 2023

Wayne Nelson, “Predicting Repair Rates with Plots,” Quality Progress, pp 34-39, Nov. 2018

J. T. Selvik, E. B. Abrahamsen, and K. J. Engemann, “Definition of Reliability and Maintenance Concepts in Oil and Gas – Validity Aspects,” Safety and Reliability, Vol. 39, no. 2, pp. 134-164, DOI:10.1080/09617353.2020.1759258, 2020

Xia Xintao, Chang Zhen, Zhang Lijun, and Yang Xiaowei, “Estimation on Reliability Models of Bearing Failure Data,” Mathematical Problems in Engineering, Vol. 2018, Article ID 6189527, https://doi.org/10.1155/2018/6189527/, March 2018

Filed Under: Articles, on Tools & Techniques, Progress in Field Reliability?

About Larry George

UCLA engineer and MBA, UC Berkeley Ph.D. in Industrial Engineering and Operations Research with minor in statistics. I taught for 11+ years, worked for Lawrence Livermore Lab for 11 years, and have worked in the real world solving problems ever since for anyone who asks. Employed by or contracted to Apple Computer, Applied Materials, Abbott Diagnostics, EPRI, Triad Systems (now http://www.epicor.com), and many others. Now working on actuarial forecasting, survival analysis, transient Markov, epidemiology, and their applications: epidemics, randomized clinical trials, availability, risk-based inspection, Statistical Reliability Control, and DoE for risk equity.

« Case Study in Tolerance Design of a Spring using Monte Carlo Simulation
Maintenance and Reliability Best Practices: Certifications »

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Articles by Larry George
in the Progress in Field Reliability? article series

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • Gremlins today
  • The Power of Vision in Leadership and Organizational Success
  • 3 Types of MTBF Stories
  • ALT: An in Depth Description
  • Project Email Economics

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy