Recurring Data Analysis: Optimizing Reliability Assessments in Maintenance Records

Introduction

Within maintenance organizations, historical failure data extracted from Computerized Maintenance Management Systems (CMMS) serves as a foundational data source for reliability analysis. This presentation highlights the challenges encountered when employing Life Data Analysis on recurring datasets, presenting an alternative statistical model, Recurring Data Analysis, tailored specifically for these scenarios.

Life Data Analysis

Life Data Analysis LDA is used to quantify the reliability of a population of items by statistically fitting its observed time-to-failure (TTF) data to some predefined distribution models. The statistical assumptions necessitate that the items are identical, and these items operate independently. These criteria are often abbreviated as IID (Identical and Independently Distributed). These IID pre-requites imply that, when applying LDA to a population of items:

  1. Each sample has one observed value, either its age at “failure" or its current age while “non-failed”.
  2. Every time-to-failure (TTF) data point must correspond to a distinct physical sample.

Consider the following failure data collected from a maintenance record, detailing the failure instances of three pumps operating independently under similar stress conditions. Pump 1 has functioned for 3 years, while pumps 2 and 3 have each operated for 2 years.

Resources_RDA_Fig01.png
Figure 1. Failure times of individual pump since its operation

Resources_RDA_Fig02.png
Figure 2. Timeline views of each pump failures

Following describes a common mistake: Take the time-between-failures for each system and fit a distribution!

Resources_RDA_Fig03.png
Figure 3. Derive the Time-to-Failures by taking the time-between-failures for each system

Resources_RDA_Fig04.png
Figure 4. Fitting a model to the dataset (Wrong Analysis!)

Fitting a Distribution Model to a Recurring Dataset... WRONG!

What are the limitations of LDA?

The requirement for identical and independently distributed (IID) data is essential in LDA. Each Time-to-Failure (TTF) data point should directly relate to a distinct physical item. However, in this instance, the time-between-failures doesn't align with the TTF of a physical item.

If we want to apply LDA to the pumps, we're constrained to considering only the time to the first failure to uphold the IID requirement. Consequently, we're left with only three data points. Moreover, dealing with subsequent failures poses a challenge.

Most failure history records obtained from CMMS or proprietary maintenance data logging systems often provide resolution details only up to the equipment level. Therefore, they primarily offer recurring information for analysis. As this type of data doesn't meet the IID requirement, using LDA for analysis becomes inappropriate.

The followings are the likely information you would like to know from your maintenance data:

  1. Predicting the expected number of failures for a given period of time, considering the current age of the equipment.
  2. Investigating whether there's statistical evidence indicating an increase in failure intensity (or decrease in Mean Time Between Failures) over time.
  3. If the Mean Time Between Failures is decreasing, exploring the possibility of calculating the optimal overhaul time.

An alternate approach is required to model recurring data effectively, enabling us to tackle these inquiries.

Recurring Data Analysis

The "Non-Homogeneous Poisson Process with Power Law" (NHPP/Power Law) stands as the predominant reliability model for recurring data. This model encompasses the Poisson Process and Power Law equations:

Resources_RDA_Img01.png

Here, P[N(t)=n] represents the probability of observing 'n' failures by time 't', while Λ(t) denotes the cumulative number of failures (Mean Value Function). The parameters λ and β within the equations are the model parameters to be determined.


Resources_RDA_Fig05.png
Figure 5. MLE solution for λ and β by fitting the NHPP/Power Law model to a recurring dataset

For this presentation, we utilized the Recurring Data Analysis tool in AeROS software to compute the model parameters λ and β. For a comprehensive understanding of the theory behind this analysis, consider exploring academic databases like PubMed, JSTOR, Google Scholar, or the IEEE Xplore Digital Library.

Figure 6 presents the recurring data from the three pumps, inputted into the RDA tool in a worksheet format.

Resources_RDA_Fig06.png
Figure 6. Fitting NHPP/Power Law model to the Pump dataset

Utilizing the model parameters λ and β, the Mean Value Function Λ(t) is derived as illustrated:

Resources_RDA_Fig07.png
Figure 7. Cumulative Failures vs Time or Mean Value Function

By taking the derivative of Λ(t), we acquire the failure intensity, denoted as µ(t) = Λ’(t), as shown in Figure 8.

Resources_RDA_Fig08.png
Figure 8. Failure Intensity vs Time

Now, let’s try to answer the following questions:

1. What is the probability that there is no failure in 100 days?

Resources_RDA_Img02.png

2. How many times will it fail within 2 years (730 days)?

Resources_RDA_Img03.png

3. Given that system 1 is already 3 years old, what is the probability that it will not fail within the next 30 days?

Resources_RDA_Img04.png

Answers:

1. What is the probability that there is no failure in 100 days?

Resources_RDA_Img05.png

2. How many times will it fail within 2 years (730 days)?

Resources_RDA_Img06.png

3. Given that system 1 is already 3 years old, what is the probability that it will not fail within the next 30 days?

Resources_RDA_Img07.png

Optimum Overhaul (Economical Life)

From Figure 8, we can see that the failure intensity of the system increases with time. Supposing the average repair cost and overhaul cost for the pump are $10,000 and $50,000 respectively, what is the optimum time to overhaul the system?

Resources_RDA_Img08.png

Summmary

This presentation delineates the inappropriateness of Life Data Analysis (LDA) when dealing with recurring data, a common occurrence in failure data extracted from Computerized Maintenance Management Systems (CMMS) used by maintenance organizations. Instead, the recommended approach in such scenarios is Recurring Data Analysis (RDA).

RDA serves as a viable method to characterize the failure rate behavior of equipment or subsystems under the following circumstances:

  1. Data are not collected at non-repairable component level (LDA analysis is not feasible).
  2. Collecting equipment failure modes is not feasible due to the extensive effort required.
  3. You want to determine the optimum overhaul of a repairable equipment.
  4. You want to estimate reliability of a repairable equipment.
  5. You want to estimate expected number of failures a fleet of repairable system.

Related articles

Repairable Spare (Rotable Spares) Forecast

A discussion on Recurrent Data Analysis using Non-Homogeneous Poisson Process with Power Law

-End-