Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Podcasts
  • Courses
    • Your Courses
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
  • Barringer Process Reliability Introduction Course Landing Page
  • Upcoming Live Events
You are here: Home / Articles / Semi-Nonparametric Reliability Estimation and Seasonal Forecasts

by Larry George 1 Comment

Semi-Nonparametric Reliability Estimation and Seasonal Forecasts

Semi-Nonparametric Reliability Estimation and Seasonal Forecasts

I estimated actuarial failure rates, made actuarial forecasts, and recommended stock levels for automotive aftermarket stores. I wondered how to account for seasonality in their sales? Time series forecasts account for seasonality but not for age, the force of mortality accounted for by actuarial forecasts. I finally figured out how to seasonally adjust actuarial forecasts. It’s the same method, David Cox’ “Proportional Hazards” model, used to make “Semi-Parametric” estimates and “Credible Reliability Predictions”.

Does AI require a lot of data to fit high order polynomials?

A coworker at Lawrence Livermore Lab recently wrote to me, “I found a description of how AI is being developed to make forecasts about health care, consumer buying trends, etc. It’s all based on inputting data to develop probability distribution functions. AI companies are starting to run out of data so they’re buying raw data from YouTube and other sites in order to “create” enough data for their probability distributions. Do you know anything about this? It sure sounds like that’s tailor-made for probability risk analysis and basic event probability development. Thoughts?”

I use periodic ships and returns counts, periodic case and death counts, or other beginning and end-event count data, by cohort. That data is sufficient to make nonparametric reliability function estimates, without lifetime data. Ships and returns counts are contained in or computed from data required by generally accepted accounting principles. 

Actuarial rates, a(t)=-d ln(R(t))/dt, are derived from reliability function estimates, R(t). The actuarial demand forecast for auto parts is ∑[a(s)n(t-s)], s=0,1,…,t, where n(t-s) is the number of vehicles of age t-s in neighborhoods of auto parts’ stores. Actuarial forecasts account for reliability based on ages of vehicles that use the parts [George 2004]. I recommended stores’ spare parts’ stock levels for Triad Computer Systems. If your auto parts store didn’t have what you needed in the late 1990s, it could have been my fault.

Even without accounting for seasonality, Triad’s actuarial forecasts and stock recommendations were so good that Triad was bought by a competitor who laid us off to save money. They contracted www.smartcorp.com to make time series forecasts, which may account for seasonality, but not wearout, the force of mortality or the ages of the vehicles.

Will AI Account for Seasonality?

I played with neural networks while working for Lawrence Livermore lab (LLNL). I helped a chemist doing experiments on safing the W79 nuclear artillery shell. He had run several 100s of test combinations without concluding what chemistry, quantity, and timing to use. I used an Ivakhnenko polynomial, www.gmdh.net, to model his test results and design subsequent experiments. I also used it for Marvin Windows to recommend sealant chemistry based on some tests [George 2016]. I used a FORTRAN program [Farlow] to fit an Ivakhnenko polynomial to test results, to design additional experiments and recommend safing method and sealant chemistry.

Neural network models are essentially high-order polynomials such as the Ivakhnenko polynomial. Neural network models are what AI programs use to fit data. My LLNL experience led me to found the Artificial Stupidity Research Institute (ASRI), which seeks and exposes examples of lazy approximate solutions to problems that could be solved mathematically, statistically, or deterministically based on physics, reality, and available data. ASRI also attempts to expose misguided or misleading standards that are stupid, lazy, or contain errors and faulty recommendations.

Epicor www.epicor.com is a successor to Triad Computer Systems where I used reliability statistics to estimate auto parts’ actuarial (age-specific) failure rates, etc. Since I was laid off by the automotive aftermarket company,www.predii.com got a contract from Epicor to use AI on “unstructured” automotive data to derive “service insights”, without accounting for vehicle age or mileage. Neither time-series nor AI accounts for age-specific failure rates or seasonality. 

Do the Best You Can With Available Data? – Accendo Reliability is a recent article attempting to counter the AI greed for data. People aren’t using available data required by generally accepted accounting principles to do reliability statistics. AI and its greed for data is an alternative to statistics and work! AI could account for seasonality, if you program it. Here is an easier way. 

Figure 1. Monthly frequency of four Nigerian refinery pumps, 2008-2015
Figure 1. Monthly frequency of four Nigerian refinery pumps, 2008-2015

Seasonally Adjusted Actuarial Forecasts?

It’s the same method used to make a “Credible Reliability Prediction” [George 2023]. Seasonal actuarial rate (age-specific ) estimation uses Cox’ PH model on season and month shipped in estimating actuarial failure rate functions, a(t;Z(cohort, season))=ao(t)Exp[βZ]. Vector Z={Z1,Z2} is a cohort period and forecast season and vector β contains regression coefficients.

David Cox’ proportional hazards model assumes failure rate functions for related products’ or parts’ lifetimes could be proportional with factor Exp[βZ] where Z accounts for product or part differences: β is a regression coefficient and ao(t) is an underlying failure rate function.

I used Cox’ proportional hazards model to make credible reliability predictions from previous generations of parts, their failure rate functions, and the ratios of MTBF(new)/MTBF(old). This is because generations of parts have proportional failure rate functions and the only knowledge during design of a new product the failure rate functions of older generations of similar parts and MTBF(new) based on MIL-HDBK-217 or other MTBF prediction. [George 2023]

What are Z variables and Proportionality Coefficients?

Suppose you have data in the form of a Nevada table: periodic cohort sales, ships, or installed base counts, and failure counts, grouped by cohort. (table 1) The Kaplan-Meier nonparametric reliability function is the maximum likelihood estimator using failure counts grouped by age, regardless of which cohort the failure came from [Kaplan and Meier]. The K-M estimator doesn’t account for cohort variations or seasonality of failure counts.

Table 1. Nevada table of ships and grouped failure counts by cohort: 18 months. 

CohortShipsPeriod 1Period 2Period 3Period 4Period 5Period 6Etc.
1471378135 
241 43476 
345  2496 
439   164 
543    26 
641     0 
Etc.        
Sums7111712173728 

Table 2. Kaplan-Meier reliability R(t) and failure rate function a(t) estimates, t = 1,2,…,18 

Age, tK-M R(t)K-M a(t)
10.96480.0352
20.85560.1132
30.69270.1905
40.52950.2356
50.35240.3344
60.21400.3927
Etc.  


Why not make the maximum likelihood failure rate function estimates from each cohort’s failures? Table 3 and figure 2 show variation around the K-M estimator that may be due to cohort and season. Figure 2 shows cohort failure rate functions are proportional, to each other and to the Kaplan-Meier estimator.

Table 3. Cohort failure rate function estimates derived from each cohort row of table 1.

Period/Age123456Etc.
10.0360.0620.1730.1770.3720.344Etc.
20.0730.0630.1360.2350.3080.294 
30.0400.0970.2000.2310.2920.294 
40.0260.2970.1560.2500.3330.214 
50.0270.2930.1710.2070.3480.333 
60.0220.0850.1630.2220.2860.350 
Etc.       
Figure 2. Broom chart of cohort reliability function estimates. Fat orange line is Kaplan-Meier estimate.
Figure 2. Broom chart of cohort reliability function estimates. Fat orange line is Kaplan-Meier estimate. 

Derive the proportional hazards model for cohort actuarial rate function estimates, a(t;Z(cohort), Z(season)) = ao(t)EXP[Z1*β1+Z2*β2]. Z(cohort)=Z1 is cohort age, date, or period number and Z(season)=Z2 is season index number. Z1 accounts for cohort variations that may not be due to season (table 4). Use least squares to minimize the squared differences between the observed cohort failure counts and actuarial estimates, as functions of ao(t), β1 and β2. If there is periodicity among the cohort failure rate functions β1 will be non-negligible in magnitude. If β2 is non-negligible, then you could combine cohort data within seasons for estimating cohort actuarial rates; if you find failure seasonality and have more than one year’s data, you could combine data from same seasons. 

Notice that the cohorts in the first column have Z1 values 1,2,3,1,2,3,…, because there is apparent quarterly repetition. The magnitude of β1 indicates quarterly periodicity. Annual seasonality in failures is apparent because of the magnitude of β2. According to time series FORECAST.ETS, there is no apparent seasonality in period failures, the bottom row of table 1.

Table 4. Least squares estimates of proportional hazards model: ao(t), and coefficients β1 and β2. Seasonality period is 1,2,…,12.

β10.3140 SSE->284    
β20.0234       
Cohortao(t)Period123456Etc.
10.0220.2027.3210.2980.0840.2960.441 
20.0543.3570.8000.4013.2721.8792.051 
30.0650.1463.9023.9353.5850.3581.686 
10.1500.2000.3100.3550.4880.4640.240 
20.15720.9031.1810.3250.0480.1582.174 
30.1330.0661.2091.5475.1240.5980.774 
Etc.0.2680.0611.9281.6372.0440.7700.236 
Figure 3. Actuarial failure rate estimates: npmle is from ships and returns counts, K-M is from Kaplan-Meier reliability, and ao(t)Exp(ZBeta) is adjusted for season and cohort
Figure 3. Actuarial failure rate estimates: npmle is from ships and returns counts, K-M is from Kaplan-Meier reliability, and ao(t)Exp(ZBeta) is adjusted for season and cohort

Given seasonal actuarial rate function estimates, including cohort periodicity, make period 19 actuarial forecasts, specific to a season, using the cohorts’ actuarial rate estimates, with Z2 corresponding to the season 7 =19 mod 12. 

Table 4 shows observed period failure count sums vs. Excel’s FORECAST.ETS time series actuarial Kaplan-Meier hindcasts. Excel’s FORECAST.ETS includes seasonality, if detected, (ETS stands for Exponential Time Series.) Kaplan-Meier actuarial hindcasts (forecasts of past failures) “K-M” seem OK: SSE=285. Notice the table is folded to show all 19 periods? 

Table 4. Observed grouped failure counts and hindcasts. Time series FORECAST.ETS hindcasts lag “FCST.ETS”: SSE=3334. 

Period12345678910
Observed171217372833404447
FCST.ETS8.7910.4512.1113.7715.4317.0918.7520.4122.0723.72
K-M1.66.614.021.229.733.138.841.643.043.6
Continued          
Period111213141516171819 
Observed4644413244312937???? 
FCST.ETS25.3827.0428.7030.3632.0233.6835.3437.0038.66 
K-M42.941.637.736.537.031.933.035.234.12 

Table 5 shows alternative period 19 forecasts. FCST.ETS extrapolates period returns in bottom row of table 1. Kaplan-Meier uses failure counts grouped by age from table 1 without regard to cohort. Cohort 7 actuarial rates are used to forecasts period 19 returns, because 7 = 19 mod 12. Cohorts 1, 4, 7, 10 averages cohort effects. The semi-nonparametric uses the cohort 1 and season 7 Z-values to incorporate seasonality. 

Table 5. Forecasts for period 19 without and with seasonal adjustments Z1= 19 mod 3, Z2= 19 mod 12 = 7, or both. All except FCST.ETS are actuarial forecasts.

MethodForecast
FCST.ETS38.7
Kaplan-Meier35.3
Cohort 739.31
Cohorts 1, 4, 7, 10 average35.91
Seasoned Semi-NP38.08

Conclusions? Recommendations?

Why hasn’t seasonal actuarial forecasting been done? I learned seasonal time series forecasts in school. Why don’t time series forecasts account for age-specific failure rates? Time series extrapolation is easier. Why not use the Kaplan-Meier estimator and ignore cohort variations? It’s in most statistical software along with the faulty Greenwood variance estimator [George 2024].

Want semi-nonparametric estimates and seasonal actuarial forecasts? Send grouped lifetime or failure count data in form of Nevada table or send observed or censored lifetimes in form of inputs to SAS, MiniTab, ReliaSoft, Weibull, JMP, or your favorite nonparametric estimation software (pstlarry@yahoo.com).

Did you notice that the semi-nonparametric estimation used cohort ships and failure counts? Kaplan-Meier estimator uses failure counts, grouped by age. It ignores cohort variability and gives different reliability estimates and actuarial forecasts. Why not use all information in data? 

The seasonal actuarial forecasting method is for dead-forever lifetimes or grouped failure counts, not for periodic renewal counts that do not indicate numbers of preceding failures. I have an idea of how to deal with renewals, and I am looking for data to try it.

References

David Cox, Proportional hazards model – Wikipedia

Stanley J. Farlow, “A FORTRAN Program for the GMDH Algorithm,” Chapter 15 of Self-Organizing Methods in Modeling,CRC Press 1984

Laurence L George, “Actuarial Forecasts for the Automotive Aftermarket,” SAE Transactions, Vol. 113, Section 5, J. Materials and Manufacturing,  pp. 697-701, 2004

L. L. George, “What Were You Thinking? PH-GMDH Sealant Reliability Model,” Test Engineering and Management, Vol. 78, No. 5, Oct.-Nov., 2016

L. L. George, Credible Reliability Prediction, 2nd Edition, Credible Reliability Prediction? – Accendo Reliability, 2023

L. L. George, “What if Ships Cohorts Were Random,” Weekly Update, What if Ships Cohorts Were Random? – Accendo Reliability, Jan. 2024

Kaplan, E. L.; Meier, P. “Nonparametric estimation from incomplete observations”. J. Amer. Statist. Assoc., Vol. 53(282), pp. 457–481.doi:10.2307/2281868, JSTOR 2281868, 1958

Filed Under: Articles, on Tools & Techniques, Progress in Field Reliability? Tagged With: Statistics non-parametric

About Larry George

UCLA engineer and MBA, UC Berkeley Ph.D. in Industrial Engineering and Operations Research with minor in statistics. I taught for 11+ years, worked for Lawrence Livermore Lab for 11 years, and have worked in the real world solving problems ever since for anyone who asks. Employed by or contracted to Apple Computer, Applied Materials, Abbott Diagnostics, EPRI, Triad Systems (now http://www.epicor.com), and many others. Now working on actuarial forecasting, survival analysis, transient Markov, epidemiology, and their applications: epidemics, randomized clinical trials, availability, risk-based inspection, Statistical Reliability Control, and DoE for risk equity.

« Student Questions from My Root Cause Analysis Class, Part 2
Maintenance and Reliability Best Practices: Reliability Engineers »

Comments

  1. Larry George says

    February 16, 2025 at 9:28 PM

    I hate it when I make an error like that. 3rd paragraph actuarial forecast should have been for time t is SUM[a(s)*n(t-s); s=1,2,…,t. I will try to edit it to fix it.

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Articles by Larry George
in the Progress in Field Reliability? article series

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • Gremlins today
  • The Power of Vision in Leadership and Organizational Success
  • 3 Types of MTBF Stories
  • ALT: An in Depth Description
  • Project Email Economics

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy