Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Podcasts
  • Courses
    • Your Courses
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
  • Barringer Process Reliability Introduction Course Landing Page
  • Upcoming Live Events
You are here: Home / Articles / Markov Chain Analysis

by Laxman Pangeni Leave a Comment

Markov Chain Analysis

Markov Chain Analysis

Markov Chain Analysis and Eigenvector/Eigenvalue Problem: A Powerful Tool for Reliability Engineering

In reliability engineering, predicting system behavior over time is crucial for maintenance planning and risk assessment. One powerful mathematical tool for this analysis is Markov Chain modelling. In this article, I’ll demonstrate how Markov Chains can predict device reliability using a real-world example: battery reliability in reliability testing facilities.

The Problem: UPS Battery Reliability

Consider a large scale reliability test center where battery reliability is critical to support product testing.

We need to answer: “What is the probability that a facility will have sufficient battery for the testing equipment to be available at any given time?”

Understanding Markov Chain

A Markov Chain is a mathematical model that describes a sequence of possible events where the probability of each event depends solely on the state of the previous event. In our battery example, we have three states:

  • Normal: Battery functioning normally
  • Low: Battery needs attention
  • Dead: Battery non-functional

The transition probabilities between these states are shown through state diagram and are represented in a transition matrix:

In the state diagram:

  • Each state has a self-loop showing the probability of remaining in that state
  • Arrows between states show transition probabilities
  • Each row in your matrix is represented by outgoing arrows from a state
  • The probabilities for each state sum to 1.0, as required for a Markov chain
Transition State Diagram

Key observations about this Markov chain from the example:

  1. A Normal battery state has an 80% chance of staying Normal and 20% chance of going to Low
  2. A Low battery state can recover to Normal (60%), stay Low (20%), or go Dead (20%)
  3. A Dead battery can recover to Normal (30%) or remain Dead (70%)

Transition matrix:

$$\displaystyle \left[\begin{array}{ccc} 0.8 & 0.2 & 0\\ 0.6 & 0.2 & 0.2\\ 0.3 & 0 & 0.7 \end{array}\right] $$

Transition Probability Matrix

Our interest lies in finding out –

  1. Battery state at any given time period (e.g. after 15 hours given the above transition is for each hour), and
  2. Battery state on the longer run.

Two Approaches to Solution

1. Direct Markov Chain Iteration

The system’s behavior can be calculated by repeatedly multiplying the initial state vector by the transition matrix. This shows how probabilities evolve over the time.

St+1=St*P

State Probability Vs Time

2. Eigenvalue Analysis

For long-term behavior, eigenvalue analysis provides an elegant solution. The dominant eigenvector (corresponding to eigenvalue = 1) gives us the steady-state probabilities.

Result Comparison

Matrix Iteration (t=~15):

  • Normal: ≈ 0.7059
  • Low: ≈ 0.1765
  • Dead: ≈ 0.1176

Eigenvalue Method:

  • Normal: ≈ 0.7059
  • Low: ≈ 0.1765
  • Dead: ≈ 0.1176

Both methods converge to exactly the same values at the 15th time period, which demonstrates the power and consistency of these approaches. This convergence is expected because:

  1. The dominant eigenvalue (λ₁) = 1 indicates the system has a steady state
  2. The corresponding eigenvector gives us the steady-state probabilities
  3. The Markov chain iteration naturally converges to these same values over time

Looking at the graph in image 1, we can see that by period 15, the lines have completely flattened out, indicating the system has reached stability. This visual representation confirms that both methods arrive at the same conclusion.

This equivalence is valuable because it:

  • Validates our calculations
  • Confirms we’ve reached true steady state
  • Shows that either method can be reliably used for long-term predictions

Key Insights from the Analysis

  1. Steady State Convergence – The system reaches stability around step ~15, regardless of initial conditions.
  2. Reliability Metrics – Long-term operational reliability (Normal + Low states) ≈ 88.24% // System failure probability (Dead state) ≈ 11.76%
  3. Business Implications – Maintenance scheduling can be optimized based on transition probabilities; Resource allocation can be planned using steady-state probabilities; Risk assessments can be more accurately quantified

Benefits in Reliability

  1. Predictive Power – Forecasts system behavior over time; Identifies steady-state conditions; Enables proactive maintenance planning
  2. Flexibility – Can model complex systems with multiple states; Accommodates both reversible and irreversible failures; Handles time-dependent behavior
  3. Decision Support – Quantifies reliability metrics; Supports maintenance strategy development; Aids in resource planning

Conclusion

Markov Chain/Eigen vector and value analysis provides a robust framework for reliability engineering. By combining mathematical rigor with practical application, it enables better decision-making in system design and maintenance. Whether implemented in R or Excel, these tools offer valuable insights for reliability engineers and system managers.

The example demonstrated here shows how even a simple three-state system can provide rich insights into system behavior and reliability. As systems become more complex, the power of Markov analysis becomes even more valuable.

Filed Under: Articles, on Product Reliability, Reliability by Design

About Laxman Pangeni

Laxman Pangeni is a seasoned Design and Reliability Engineer specializing in predictive modeling, data science, and advanced statistical analysis to enhance product performance and reliability across complex systems.

« 5 Decisions Reliability Analysis Can Help With
Fundamentals of Managing an Outsourced (CM) Value Stream »

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Reliability by Design series logo Photo of Laxman PangeniArticles by Laxman Pangeni
in the Reliability by Design article series

Recent Posts

  • Leadership Values in Maintenance and Operations
  • Today’s Gremlin – It’ll never work here
  • How a Mission Statement Drives Behavioral Change in Organizations
  • Gremlins today
  • The Power of Vision in Leadership and Organizational Success

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy