Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Podcasts
  • Courses
    • Your Courses
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
  • Barringer Process Reliability Introduction Course Landing Page
  • Upcoming Live Events
You are here: Home / Articles / Convert AFRs to Field Reliability?

by Larry George Leave a Comment

Convert AFRs to Field Reliability?

Convert AFRs to Field Reliability?

AFRs are periodic ratios of failure counts divided by installed base. Have you seen meeting rooms wallpapered with AFR charts (Annualized Failure Rate)? Have you sat through debates about the wiggles in AFR charts? Fred Schenkelberg wondered if reliability could be estimated from AFRs and their input data? How about age-specific reliability and actuarial failure rate functions? Actuarial forecasts? MTBFs? Wonder no more!

AFR [Annualized failure rate – Wikipedia] is a ratio of failures divided by time or installed base, computed periodically: AFR = (failures/Operating time)*(Annualization factor). Julio Calderon found that HDD AFR and vendor 8766/MTBFs didn’t agree! (AFR=8766/MTBF; there are 8766 hours in an average year.)

AFRs and MTBFs are not reliability! Reliability, R(t), is P[Life > t] for t ≥0, and a(t) is the actuarial failure rate, [R(t‑1)‑R(t)]/R(t-1). Don’t extrapolate AFRs to make failure forecasts! That’s like driving while looking backwards. Why not use AFR or FRACAS [MIL-STD-2155] input data to estimate age-specific reliability? Why not make actuarial failure forecasts? An actuarial forecast is ∑a(s)n(t-s), s=0,1,2,…,t, where n(t‑s) is the installed base of age t-s. Actuarial forecasts account for the forces of mortality that cause failures. Look ahead!

How to Extract Installed Base Given AFRs and Failure Counts?

Ships (cohort sizes or installed base by age t), n(t), and failures or returns counts, r(t), are statistically sufficient to make nonparametric estimates of reliability and actuarial failure rate functions [George 1993]. What if you have periodic AFRs and failure counts r(t) but not installed base, n(t)? (Failures or returns counts r(t) in period t could be returns shipped in any previous or current period.) Given failures or returns counts r(t) and successive AFRs, find ships or installed base cohort sizes n(t), t=0,1,2,…. The solution for a two-period solution is 

n(1)=r1/AFR(1) and n(2)=(AFR(2)*r(1)-AFR(1)*r(2))/(AFR(1)*AFR(2)),

and the general solution is 

n(t)=(AFR(t‑1)*r(t)-AFR(t)*r(t-1))/(AFR(t-1)*AFR(t)). 

Table 1 is an example. AFR in column 5 is the ratio of Fails/Cum Ships. Cohorts “n(t)” values in column 6 are computed from the last formula. Cohorts in column 6 match simulated ships. That solution doesn’t work when either AFR() in denominator is zero or when the numerator is negative. 

Table 1. Ships are simulated Poisson(1000) monthly; “Fails” are fake. AFR = Fails/Cum Ships. Reliability is the maximum likelihood estimator.

MonthsShipsCum ShipsFailsAFRn(t)Reliability
11016101610.00098410160.9990
21007202330.00148310070.9970
31012303550.00164710120.9951
4968400370.0017499680.9928
510295032110.00218610290.9893
610046036140.00231910040.9861

Backblaze Data?

Backblaze publishes HDD (Hard-Disk Drive) and flash drive AFRs and quarterly input data: {Mfg, Model, Size, failure count, Days, Failures, AFR}. Backblaze went public in 2022 (BLZE); their www.backblaze.com site sells cloud storage. Andy Klein’s quarterly reports are under their “About” menu. He reports on millions of HDDs and summarizes, in AFRs, information about individual HDDs. IDEMA published standards for tracking individual HDD lifetimes by “vintage” including use factors that may facilitate root cause analysis [Elerath].

AFRs are not as informative as age-specific field reliability and failure rate function estimates, especially as estimates evolve during product life cycles. Backblaze quarterly report data is sufficient to estimate age-specific field reliability. I picked the Western Digital HDDs because their data started from first shipments. Their failure counts do not include failures from previous, unknown ships cohorts. [It is possible to account for failures from previous cohorts; e.g. COVID-19 mutations, Field Reliability – Corona virus survival analysis (google.com).]

Table 2. WDC Western Digital – Wikipedia reports’ HDD AFRs. The Q4 AFRs come from annual reports. 

 2020Q42021Q1Q2Q3Q42022Q1Q2Q3Q4
WDC0.16%0.57%0.46%0.39%0.32%0.00%0.16%0.30%0.40%
WUH741414ALE6L4    0.43%   0.12%
Installed6002   8408   8410
WUH741816ALE6L0    0.14%   0.12%
Installed    1767   2701
WUH741816ALE6L4        0.36%
Installed        10801

Table 3. WDC 2022 quarterly reports contain cumulative installations and failure counts too. 

Q1CountFailuresAFR
WUH741414ALE6L48408–0
WUH741816ALE6L02599–0
WUH741816ALE6L41200–0
Q2   
WUH741414ALE6L4840820.10%
WUH741816ALE6L0270220.30%
WUH741816ALE6L4119910.34%
Q3   
WUH741414ALE6L4840950.24%
WUH741816ALE6L02702–0.00%
WUH741816ALE6L4713860.71%
Q4  AFR
WUH741414ALE6L48410100.12%
WUH741816ALE6L0270130.12%
WUH741816ALE6L410801130.36%

Table 4. Quarterly ships and failure counts for input to age-specific reliability estimation. 

 WUH741414ALE6L4 WUH741816ALE6L0 WUH741816ALE6L4 
PeriodShipsFailuresShipsFailuresShipsFailures
2020Q46002     
2021Q1602     
2021 Q2602     
2021 Q3601     
2021 Q46013517671  
2022 Q1 0832012000
Q2 2103201
Q3 50051396
Q4 1003363313

How to Estimate Reliability from AFRs and Failure Counts Without Life Data?

I found cumulative installed base in quarterly reports and computed the quarterly ships n(t) so I didn’t have to infer installed base as in table 1, which made errors from the AFRs’ round-off and zeros. 

Reliability R(t) estimates (table 5) depend on whether failure counts are dead forever or renewals. Backblaze data is recorded by HDD serial number so failure counts are probably dead forever. A Google spreadsheet for dead-forever reliability estimation is available [George 2023]. My brother and I did reliability estimation for Western Digital in 1994 (column 5). Figure 1 shows that WD HDD reliability hasn’t changed much since 1994! Table 6 and figures 2-4 compare AFRs and monthly actuarial rates.

Table 5 WD HDD reliability estimates. 

Age, MonthsWUH741414ALE6L4WUH741816ALE6L0WUH741816ALE6L4WD 1994
01111
10.99983110.99893
20.9958310.999170.99485
30.991670.998990.999160.99485
40.988330.998990.997270.99485
50.973820.998990.997270.96769
60.973820.970870.993120.96769
70.958020.970870.993120.96769
80.955960.94175?0.96769
90.951830.92233?0.96769
100.225000.92233?0.96769
11?0.91262?0.96769
Figure 1. WDC HDD reliability estimates

Table 6. Compare monthly actuarial failure rates (columns 2, 4, 6, 8) and AFRs columns 3, 5, 7)

AgeWUCAFRWUCAFRWUCAFRWD
Months741414ALE6L4741414ALE6L4741816ALE6L0741816ALE6L0741816ALE6L4741816ALE6L41994
10.017%0.16%0.0%0.0%0.0%0.0%0.107%
20.400%0.57%0.0%0.0%0.083%0.23%0.409%
30.418%0.49%0.101%0.44%0.001%0.71%0.0%
40.336%0.38%0.0%0.14%0.189%0.36%0.0%
51.469%0.43%0.0%0.0%0.0%0.19%2.730%
60.0%0.0%2.814%0.15%0.416%0.38%0.0%
71.622%0.29%0.0%0.0%0.0%0.35%0.0%
80.215%0.24%3.000%0.12%  0.0%
90.433%0.12%2.062%0.30%  0.0%
10 0.48%0.0%0.0%  0.0%
11 0.77%1.053%0.15%  0.0%
Figure 2. Age-specific monthly failure rates (blue) resemble AFRs (orange)?
Figure 3. Age-specific monthly failure rates (blue) don’t resemble AFRs (orange)?
Figure 4. Age-specific monthly failure rates (blue) resemble AFRs (orange)?

Estimate MTBF from AFRs?

MTBF=8766*AFR where MTBF is in hours and 8766 is the number of hours in an average year. That MTBF is potentially biased and variable. MTBFs are predictions or estimates of the mean lives of products or parts, MTBF=∫R(t)dt where R(t) is the reliability function and the integral is from 0 to infinity. 8766*AFR-based MTBF estimates may be biased, especially in early lives, before many have failed. Computing MTBF = ∫R(t)dt ≅ ∑R(t) requires extrapolation beyond available data: linear, exponential, seasonal, or curve fitting to popular reliability functions. I extrapolated failure rate functions, a(t), and R(t) = exp[‑∑a(s)] s=1,2,…,500. Table 7 shows results of alternative extrapolation methods for a Western Digital HDD.

Table 7. Compare 12-month average of MTBF=8766*AFR(t) vs. MTBF = ∫R(t)dt extrapolated beyond 9th month of actuarial failure rate estimates.

MethodAvg(AFR*8766)LinearConstantGrowthTrendETS
MTBF, months12.8848.6116721512.8849.5

Linear increase in failure rate is a reasonable extrapolation of wearout. It yields a more reasonable HDD ~48-month MTBF than the average of the first 12 AFR*8766 values. The exponentially smoothed time series forecast (ETS) of 49.5 months agrees. Constant and exponential growth overestimate MTBF. Trend finds the linear extrapolation that fits, by least squares, the first 12 AFR*8766 values. The ~48- to 49-month MTBF seems more likely than the other methods. 

Recommendations?

Compute AFRs if you want, but estimate age-specific reliability and failure rate functions too; you have the data. Failure rate functions account for the forces of mortality that cause failures. Make actuarial forecasts, because they’re more accurate and precise than time series extrapolations. Actuarial forecasts and their distributions help plan service and inventory better than AFR time-series extrapolations. If you must supply MTBF predictions, base them on extrapolations of age-specific reliability and actuarial failure rate function estimates. 

References

Julio Calderon, “Effortless! How to Apply AFRs, MTBFs to Your Data Management Practice,” (3) Effortless! How to apply AFRs, MTBFs to your data management practice. | LinkedIn

J. G. Elerath, “AFR: problems of definition, calculation and measurement in a commercial environment,” Annual Reliability and Maintainability Symposium, 2000 Proceedings, International Symposium on Product Quality and Integrity (Cat. No.00CH37055), Los Angeles, CA, USA, pp. 71-76, doi: 10.1109/RAMS.2000.816286, 2000

L. L. George, “Estimate Reliability Functions Without Life Data”, ASQ Reliability Review, Vol. 13, No. 1, March 1993

L. L. George, Credible Reliability Prediction 2nd Edition, Credible Reliability Prediction – Field Reliability (google.com), CREDRP2020.PDF, June 2020

L. L. George, User Manual for Credible Reliability Prediction, User Manual for Credible Reliability Prediction – Field Reliability (google.com), CRPUSM1.PDF, June 2020

L. L. George, “Estimate Field Reliability Without Life Data,” Weekly Update, https://fred-schenkelberg-project.prev01.rmkr.net/estimate-field-reliability-without-life-data/#more-527694, Sept. 2023

Andy Klein, “Backblaze Drive Stats for 2022,” January 31, 2023 

DoD Handbook, Failure Reporting, Analysis and Corrective Action Taken, Mil-Std-2155(AS), Dec. 1995

Filed Under: Articles, on Tools & Techniques, Progress in Field Reliability?

About Larry George

UCLA engineer and MBA, UC Berkeley Ph.D. in Industrial Engineering and Operations Research with minor in statistics. I taught for 11+ years, worked for Lawrence Livermore Lab for 11 years, and have worked in the real world solving problems ever since for anyone who asks. Employed by or contracted to Apple Computer, Applied Materials, Abbott Diagnostics, EPRI, Triad Systems (now http://www.epicor.com), and many others. Now working on actuarial forecasting, survival analysis, transient Markov, epidemiology, and their applications: epidemics, randomized clinical trials, availability, risk-based inspection, Statistical Reliability Control, and DoE for risk equity.

« Unique Insights and Thoughts Behind the Plant Wellness Way
ISO 31000 Scope, Context, and Criteria »

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Articles by Larry George
in the Progress in Field Reliability? article series

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • Gremlins today
  • The Power of Vision in Leadership and Organizational Success
  • 3 Types of MTBF Stories
  • ALT: An in Depth Description
  • Project Email Economics

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy