3.1 - INTRODUCTION

Nonparametric estimation

At the beginning of any statistical study, it’s best to allow the data to speak for itself - before attempting to fit the data with a parametric distribution
Chapter 3 outlines methods to compute nonparametric estimates for

\[ \widehat{F}(t), \widehat{S}(t), se(\widehat{F}(t)) \]

The goal will be to compute \((1-\alpha)\times 100\%\) confidence intervals on point estimates for the true, but unknown, values
First, we’ll work with singly censored data before extending to multiply censored data and then to data with arbitrary censoring

Nonparametric techniques

Can be used when we don’t know the underlying distribution governing the failure process that produced the data
Are commonly used in survivability analyses (biostatistics and actuarial sciences)
Are gaining traction in reliability studies
Can simplify analyses by quickly revealing poor-fitting parametric models
Can be used as a benchmark to compare accuracy of parametric analyses
Cannot be used for prediction outside the range of observed data

3.2 - ESTIMATION FROM SINGLY CENSORED DATA

Singly censored data

Tests with only one censoring event occurring at time \(t_c\)
For a singly right censored observation, the actual failure time is unknown because the unit had not failed when the censoring event occurs
A right censored observation results when the observational period ends before the event of interest occurs for a test unit
- Product-to-market schedule limits the allocated test time for a new mobile device
- Non-safety critical aircraft components replaced at each depot cycle whether or not they have failed

par(mar = c(0,0,0,0)) ; set.seed(42)
plot(NA,
     axes = F,
     xlab = "",
     ylab = "",
     xlim = range(0,105),
     ylim = range(16,18))
segments(x0 = c(0,70),
         y0 = c(18,18),
         x1 = c(0,70),
         y1 = c(17,17),
         lwd = 2)
arrows(x0 = 0,
       y0 = 17.5,
       x1 = 100,
       y1 = 17.5,
       lwd = 2)
text(x = c(0,70,104),
     y = c(16.8,16.8,17.5),
     labels = c(0,
                expression(t[c]),
                expression(infinity)),
     cex=c(1.5,1.5,2))
text(x = c(runif(20,5,65),runif(5,75,95)),
     y = c(rep(17.5,25)),
     labels = c(rep("x",20),rep("?",5)), 
     col = c(rep(1,20),rep(2,5)), 
     cex = 2)

For a singly left censored observation the actual failure time is unknown because the unit had already failed when the censoring event occurs
Left censored observations can result when it is difficult or expensive to inspect samples, if the test was set up improperly, or if an unknown flaw causes an infant mortality failure
- Tests conducted in remote locations (polar regions, space, desert)
- Inspection requires extensive maintenance to gain access

par(mar = c(0,0,0,0)) ; set.seed(42)
plot(NA,
     axes = FALSE,
     xlab = "",
     ylab = "",
     xlim = c(0,105),
     ylim = c(16,18))
segments(x0 = c(0,70),
         y0 = c(18,18),
         x1 = c(0,70),
         y1 = c(17,17),
         lwd = 2)
arrows(x0 = 0,
       y0 = 17.5,
       x1 = 100,
       y1 = 17.5,
       lwd = 2)
text(x = c(0,70,104,runif(25,5,65)),
     y = c(16.8,16.8,17.5,rep(17.5,25)),
     labels = c(0,expression(t[c]),
                expression(infinity),
                rep("?",25)),
     cex = c(1.5,1.5,rep(2,26)),
     col = c(1,1,rep('blue',26)))

We’ll focus on tests that produce singly right censored observations as they often occur in practice

3.2 - ESTIMATION FROM SINGLY CENSORED DATA - Example

Example 3.1 & 3.2 - Plant 1 Heat Exchanger Data

Motivation
- This example introduce the concept of estimating failure probability from singly right censored data using the binomial distribution
- Recall the heatexchanger data set presented in Example 1.5
- Suppose we inspect \(100\) heat exchanger tubes at Plant 1 only, and assign each tube a value \(d_i\), where \[ d_i=\begin{cases}1&\mbox{if the tube has failed}\\\\ 0&\mbox{if the tube has not failed}\end{cases} \]
- Figure 3.1 shows the number of failures observed during each of the three years of operation

par(family = 'serif', font = 2)

plot(NA,
     axes = FALSE,
     xlab = '',
     ylab = '',
     xlim = range(-50,350),
     ylim = range(-10,150))

segments( x0 = c(0,0,0,100,200,300),
          y0 = c(50,100,50,20,20,20),
          x1 = c(350,350,0,100,200,300),
          y1 = c(50,100,100,100,100,100),
          lwd = c(rep(2,6)))

text(x = rep(-35,6),
     y = c(seq(27,127,50),seq(17,117,100),6),
     labels = c('Unconditional',
                'Plant 1',
                '100 tubes',
                'Failure',
                'at start',
                'Probability'),
     cex = c(1,1.15,1,1,1,1))

text(x = c(seq(50,250,100),325),
     y = rep(20,4),
     labels = c(expression(pi[1]),
                expression(pi[2]),
                expression(pi[3]),
                expression(pi[4])),
     cex = 1.5)

text(x = seq(50,250,100),
     y = rep(108,3),
     labels = c('Year 1','Year 2','Year 3'),
     cex = rep(1.15,3))

text(x = c(150,320,320),
     y = c(rep(140,2),130),
     labels = c('Cracked tubes','Uncracked','tubes'),
     cex = rep(1,3))

text(x = c(seq(50,250,100),325),
     y = rep(77,4),
     labels = c('1','2','2','95'),
     cex = rep(1.5,4))

Figure 3.1 - Diagram of the Plant 1 data from the heatexchanger data set

Data set subset(SMRD::heatexchanger, plant=='Plant1')
- The inspection data are \(100\) binary observations from a Bernoulli RV
- The Bernoulli parameter \(p \in [0,1]\) represents the probability of observing a “success” in this case a success is actually a failure
- The trials are mutually independent (i.e. The probability of “success” is the same for each observation )

library(SMRD)
library(DT)

DT::datatable(subset(SMRD::heatexchanger, plant == "Plant1"))

The outcome of this inspection procedue (the total number of observed “successes”) is a single observation from a binomial RV with parameters \(n, p\)
- Recall, the binomial distribution models the probability of observing \(x\) successes in \(n\) trials where the probability of success for each trial is \(p\)
- The probability mass function for a \(BIN(n,p)\) random variable is

\[ f(x;p,n) = \left( \begin{array}{c} n \\ x \end{array} \right)(p)^{x}(1 - p)^{(n-x)} \;\;\;\;\;\; \mbox{for x = 0, 1, 2,..., n} \]

where
- \(x\) is the number of observed “successes” (failures in this case)
- \(n\) is the total sample size
- \(p\) is the true (but unknown) long-run probability that a unit will fail
A nonparametric estimator for \(F(t)\) is the binomial parameter \(p\)

\[\hat{p}_{_{MLE}}=\frac{x}{n}\]

Figure 3.2 shows \(\hat{p}_{_{MLE}}\) for each of the three years of operation, along with the upper and lower \(95\%\) CI

par(family = 'serif', font = 2, cex = 1.15)

library(package = SMRD)

HE.ld <-
frame.to.ld(subset(SMRD::heatexchanger, plant == 'Plant1'),
            response.column = c(1,2),
            censor.column = 3,
            case.weight.column = 4,
            time.units = 'Years')

plot(HE.ld,
     band.type = 'Pointwise',
     ylim = c(0,.2),
     xlim = c(0,3))

Figure 3.2 - Nonparametric CDF plot of the heatexchanger data set (Plant 1 only)

3.2 - ESTIMATION FROM SINGLY CENSORED DATA

Recall, the cdf \(F(t)\) is defined everywhere in \(\mathbb{R}\), i.e. \((-\infty, \infty)\)

But, if inspections occur in discrete intervals, \(F(t)\in (t_{i-1},t_{i})\) may be unknown
- If no failures occur in \((t_{i-1},t_{i})\), \(F(t)\) is constant
- If one or more failures occur in \((t_{i-1},t_{i})\), we can estimate \(F(t_{i})-F(t_{i-1})\) But we won’t know when the failures occured in the interval
Convention for this text
- There’s insufficient information to assign a value of \(\widehat{F}(t)\in (t_{i-1},t_{i})\) for censored observations occuring between inspections
- Changes in \(\widehat{F}(t)\in (t_{i-1},t_{i})\) result at the end of the interval

3.3.2 - Confidence Intervals

Sampling errors

We usually can’t inspect every unit in a population, but we can inspect a random sample taken from the population
Regardless of how the sample was drawn, sampling errors will be present
The amount of sampling error depends on the size of the sample (relative to the size of the population)
As the sample size increases the properties of the sample approach the properties of the overall population and the sampling errors become smaller

Sources of uncertainty

There are many ways that can introduce uncertainty into our analyses
- Limited understanding of how a system interacts with its operating environment
- Imperfect knowledge of manufacturing flaws & deviations between units in a sample
- Errors introduced by fitting data to a parametric model
- Small sample sizes can amplify each of these uncertainties
Confidence intervals (CI) are useful tools for quantifying the uncertainty associated with an estimate or a prediction due to sampling errors

3.3.2 - Confidence Intervals - Example

Understanding Confidence Intervals

What does it mean to say “…the \(100\times(1-\alpha)\%\) CI for \(\theta =(\underline{\widehat{\theta}},\overline{\widehat{\theta}})\)…”
For \(\alpha=0.05,\) a \(95\%\) CI on \(\theta\) this means
- If an experiment were run \(i\) times (each run producing a distinct data set)
- The true (but unknown) value of \(\theta\) would be captured in the interval \((\underline {\widehat {\theta_{i}}},\overline{\widehat{\theta_{i}}})\) in \(95\%\) of those runs.
A \(95\%\) CI on \(\theta\) does not mean \(P(\underline{\widehat{\theta_{i}}} <\theta<\overline{\widehat{\theta_{i}}})=0.95\)
- For each experiment the probability \(P(\underline{\widehat{\theta_{i}}} <\theta<\overline{\widehat{\theta_{i}}})\) is either \(1\) or \(0\)
- This is because \(\theta\) is either in the interval, or it isn’t
The confidence_intervals app from the teachingApps package illustrates the idea of generating confidence intervals
Run the app by pasting the code below into the R console

teachingApps::teachingApp('confidence_intervals')

Exact CI vs Approximate CI

There are different ‘classes’ of quantities that we may want to estimate
- Unconstrined quantities that can be positive or negative
- Quantities that are strictly positive
- Quantities defined between 0 and 1
Specific procedures exist to construct \(100\times (1-\alpha)\%\) CI’s for quantities in each of these classes
Most of the CI estimation procedures produce approximate confidence intervals
- For confidence level \(\alpha\), \(1-\alpha\) is the nominal coverage probability
- The actual proportion of runs that capture the true value being estimated is less than a \(1-\alpha\)
- Approximate CI’s result if the procedure used to construct the CI is based on a distribution that differs from the distribution of the quantity being estimated
- Only approximate CI methods exists for data sets containing time-censored observations
For some data sets, CI estimation procedures exist to produce exact confidence intervals
- For confidence level \(\alpha\), \(1-\alpha\) is the nominal coverage probability
- The actual proportion of runs that contain the true value being estimated is equal to \(1-\alpha\)
- Exact CI’s result if the procedure used to estimate the CI is based on a same distribution as that of the value being estimated
- Exact CI’s result for complete data sets and some data sets with Type-II right-censored observations
Coverage probability is a measure used to compare the performance of different CI estimation procedures

3.4.1 - Pointwise binomial-based CI for \(F(t_i)\)

Clopper-Pearson CI For Proportions

Eq. 3.2 shows the Clopper-Pearson formulae for calculating exact binomial confidence intervals for \(F(t)\)

\[ (\underline{F}(t_i),\overline{F}(t_i))=\left(1+\frac{(n-n\widehat{F}+1)\mathcal{F}_{(1-\alpha/2,2n-2n\widehat{F}+2,2n\widehat{F})}}{n\widehat{F}}\right)^{-1}, \left(1+\frac{n-n\widehat{F}}{(n\widehat{F}+1)\mathcal{F}_{(1-\alpha/2,2n\widehat{F}+2,2n-2n\widehat{F})}}\right)^{-1} \]

Where
- \(\mathcal{F}_{c;d_{1},d_{2}}\) is the value of the F-distribution quantile function
- \(c \in [0,1]\) represents the specific quantile value
- \(d_{1}\) and \(d_{2}\) are the respective degrees of freedom
Note
- The presence of the F-distribution in (3.2) does not preclude the C-P interval from being exact
- Rather, the C-P interval is based on the relationship between the binomial and F distributions

3.4.1 - Pointwise binomial-based CI for \(F(t_i)\) - Example

Example 3.3 - Binomial CI for \(F(t_i)\)

For the Plant 1 heat exchanger data at \(t_{i}=2\) we have
- Sample size \(n=100\)
- Number of failures in \((0,2]\) is \(d=3\)
- \(\mathcal{F}_{_{(.975;200-6+2,6)}}=4.8830932\)
- \(\mathcal{F}_{_{(.975;6+2,200-6)}}=2.2578325\)
Therefore the \(95\%\) confidence interval for \(F(2)\) based on the data from this inspection is expressed as

\[ \begin{aligned} \underline{F}(2)&=\left(1+\frac{(100-100\times\frac{3}{100}+1)\mathcal{F}_{_{(.975;200-6+2,6)}}}{100\times \frac{3}{100}}\right)^{-1}=0.0062\\\\\\ \overline{F}(2)&=\left(1+\frac{100-100\times\frac{3}{100}}{(100\times\frac{3}{100}+1)\mathcal{F}_{_{(.975;6+2,200-6)}}}\right)^{-1}=0.0852 \end{aligned} \]

The following code can be used to return the results of the expression above

cp.ci<-function(n,d,a)  {
    Fhat<-d/n
    f1<-2*n-2*n*Fhat
    f2<-2*n*Fhat
    cp.lower<-(1+(n-n*Fhat+1)*qf(1-a/2,f1+2,f2)/(n*Fhat))^-1
    cp.upper<-(1+(n-n*Fhat)/((n*Fhat+1)*qf(1-a/2,f2+2,f1)))^-1
return(c(cp.lower,cp.upper))  }

cp.ci(100,3,.05)

[1] 0.006229972 0.085176053

3.4.2 - Pointwise Normal-approximate CI for \(F(t_i)\)

Confidence Intervals Based on the Normal Distribution

There are many methods for constructing confidence intervals - the approach based on the Normal approximation is the most commonly used and is used most often in this course for computing CI for \(F(t_{i})\)
Assumes the sampling errors are normally distributed WRT the binomial point estimate, i.e.

\[ Z_{\widehat{F}}=\frac{\widehat{F}(t_i)-F(t_i)}{\widehat{se}_{\widehat{F}}}\xrightarrow d N(0,1) \text{ as } n\rightarrow\infty \]

Where
- \(F(t_{i})\) is the true (but unknown) value of the CDF at time \(t_i\)
- \(\widehat{F}(t_{i})=\sum_{j=1}^{i}d_{i}/n\) is the binomial point estimate of \(F(t_{i})\)
- \(\widehat{se}_{\widehat{F}}=\sqrt{\widehat{F}(t_{i})[1-\widehat{F}(t_{i})]/n}\) the standard error of \(\widehat{F}\) based on the sample

3.4.2 - Pointwise Normal-approximate CI for \(F(t_i)\) - Example

Example 3.4 - Normal-approx CI for Plant 1 \(F(t_i)\)

Consider again the heat exchanger data for Plant 1
At the end of Year 3, five failures were observed, thus

\[ \widehat{F}(3)=\frac{5}{100}=0.05 \;\;\;\text{and}\;\;\; \widehat{se}_{\widehat{F}(3)}=\sqrt{\frac{.05(1-.05)}{100}}=.02179 \]

The \(95\%\) CI for \(F(3)\) using the normal approximation is then

\[ [\underline{F(3)}, \overline{F(3)}]=.05\pm F^{-1}_{_{NOR}}(P = 0.975)\times .02179 = [.0073,.0927] \]

Where

\(F^{-1}_{_{NOR}}(P = .975)\equiv z_{.975}\equiv\) qnorm(.975)

Table 3.1

Compares the CI’s produced by the Binomial and the Normal approximation for the Plant 1 data over years \(1, 2 \;\&\; 3\).
The CI’s produced by the binomial method are more conservative for each time interval, particularly in the upper limit.
The CI’s produced using the Normal approximation method have negative values for the lower limit in the first and second time intervals. Why?

3.4.2 - Pointwise Normal-approximate CI for \(F(t_i)\) - Example 3.5

Integrated Circuit Life Test Data

Motivation for this example
- As the number of inspections increase, the interval widths \((t_{i-1},t_{i}]\) approach zero
- In this experiment \(n=4156\) integrated circuits were tested to failure up to 1370 hours
- We dont know how the circuits were tested, just that \(d=28\) ‘exact’ failures were observed
Data set lfp1370 & Nonparametric Plots
- The data set and related figures may be seen by running the code below
- The presence of ties is a clear indication that this is grouped read-out data as defined in Chapter 1 and the failures were discovered at periodic inspections.
- The inspection interval may have been set to every .05 hours at the beginning of the test and then extended as the test progressed.

SMRD::smrd_app('lfp1370_data')

3.5 - ESTIMATION FROM MULTIPLY CENSORED DATA

Multiply censored data results when

Units are removed from the test, before observing the event of interest
Units fail due to an unexpected failure modes that are not of interest
Units are removed from the test to fill a real-world need
Units enter the test at different times
An error affects some of the units after testing begins
Example: Pooling the observations from all three plants in the heat exchanger data

Figure 3.4 - Pooled heat exchanger data

par(family = "serif", font = 2, lwd = 2, cex = 1.15)
plot(NA,
     axes = FALSE,
     xlab = "",
     ylab = "",
     xlim = c(-50,350),
     ylim = c(-10,150))
segments(x0 = c(0,0,0,100,200,300),
         y0 = c(0,100,0,0,0,0),
         x1 = c(350,350,0,100,200,300),
         y1 = c(0,100,100,100,100,100))
text(x = -35, y = 50, labels = "All Plants")
text(x = seq(75, 275, 100),
     y = rep(-8, 3),
     labels = c("4/300","5/197","2/97"))
text(x = seq(50, 250, 100),
     y = rep(108, 3),
     labels = c("Year 1","Year 2","Year 3"))
text(x = seq(140, 340, 100),
     y = rep(140,3),
     labels = c("99","95","95"))
text(x = seq(15, 215, 100),
     y = rep(90, 3),
     label = c("300","197","97"))
text(x = 75, y = 140, labels = "Uncracked tubes:")
arrows(x0 = seq(100,300,100),
       y0 = rep(100,3),
       x1 = seq(130,330,100),
       y1 = rep(125,3),
       length = rep(0.25,3))

3.5 - Estimation from multiply censored data

For singly censored data, \(F(t_{i})\) was estimated assuming that \(n\) was constant
For multiply censored data, the number of units at risk in \((t_{i-1},t_{i}]\) is expressed as

\[ n_{i}=n-\sum_{j=0}^{i-1}d_{j}-\sum_{j=0}^{i-1}r_{j}, i=1,...,m \]

where
- \(d_{j} \equiv\) the number of failed units in interval \(j, j=0,...,i-1\)
- \(r_{j} \equiv\) the number of censored units in interval \(j, j=0,...,i-1\)
- \(m \equiv\) the number of intervals (need not be of equal length)
It follows that \(\widehat{p}_{i}\) (the conditional probability that a unit will fail in interval \(i\), given that it has survived each of the previous intervals) is expressed as

\[ \widehat{p}_{i}=\frac{d_{i}}{n_{i}}, i=1,...,m \]

\[ \begin{aligned} \widehat{S}(t_{i})&=\prod_{j=1}^{i}\left[1-\widehat{p_{j}}\right], i=1,...,m\\\\ \widehat{F}(t_{i})&=1-\prod_{j=1}^{i}\left[1-\widehat{p_{j}}\right], i=1,...,m \end{aligned} \]

Fig 1.7 - Heat exchanger crack inspection data

par(family = 'serif',font = 2,mar = c(0,0,0,0))

plot(NA,
     axes = FALSE,
     xlab = '',
     ylab = '',
     xlim = range(-50,350),
     ylim = range(-10,300))

segments( x0 = c(0,0,0,0,0,rep(0,15),100,200,300),
          y0 = c(0,100,200,300,0,seq(232,280,12),seq(132,180,12),seq(32,80,12),0,0,0),
          x1 = c(350,350,350,350,0,rep(300,5),rep(200,5),rep(100,5),100,200,300),
          y1 = c(0,100,200,300,300,seq(232,280,12),seq(132,180,12),seq(32,80,12),300,300,300),
          lwd = c(rep(2,5),rep(1,18)))

text(x = rep(-25,3),
     y = seq(56,256,100),
     c('Plant 3','Plant 2','Plant 1'),
     cex=rep(0.9,3))

text(x = c(rep(50,3),rep(150,2),250),
     y = c(90,190,290,190,rep(290,2)),
     labels = c('1 failure',
                '2 failures',
                '1 failures',
                '3 failures',
                '2 failures',
                '2 failure'),
     cex = rep(0.8,6))

text(x = seq(50,250,100),
     y = rep(-8,3),
     labels = c('Year 1',
                'Year 2',
                'Year 3'),
     cex = rep(0.9,3))

text(x = c(110,210,310),
     y = seq(56,256,100),
     labels = c('99',
                '95',
                '95'),
     cex = rep(0.9,3))

segments(x0 = c(120,220,320),
         y0 = seq(56,256,100),
         x1 = c(345,345,345),
         y1 = seq(56,256,100),
         lty = rep(2,3))

arrows(x0 = c(rep(295,5),rep(195,5),rep(95,5)),
       y0 = c(seq(232,280,12),seq(132,180,12),seq(32,80,12)),
       x1 = c(rep(300,5),rep(200,5),rep(100,5)),
       length = rep(0.1,15))

arrows(x0 = rep(345,3),
       y0 = seq(56,256,100),
       x1 = rep(350,3),
       length = rep(0.1,3))

Figure 1.7 - Diagram of the transformed heatexchanger data

Table 3.2 - Pooled Heat Exchanger Data

\[ \begin{array}{lrrrrllrr} \hline Year & t_i & d_i & r_i & n_i & p_i & 1-p_i & S(t_i) & F(t_i) \\ \hline (0-1] & 1 & 4 & 99 & 300 & 4/300 & 296/300 & 0.9867 & 0.0133 \\ (1-2] & 2 & 5 & 95 & 197 & 5/197 & 192/197 & 0.9616 & 0.0384 \\ (2-3] & 3 & 2 & 95 & 97 & 2/97 & 95/97 & 0.9418 & 0.0582 \\ \hline \end{array} \]

Pooling the data assumes the usage pattern is consistent across all three plants
If this is a valid assumption, then the “test” begins with an sample set of 300 tubes

3.6 - POINTWISE CI FROM MULTIPLY CENSORED DATA

For multiply censored data, no method exists to compute exact CI’s
- We must rely on normal approximations
- In general, the normal approximation will be of the form

\[ Z_{\widehat{F}}=\frac{\widehat{F}(t_{i})-F(t_{i})}{\widehat{se}_{\widehat{F}}} \]

Computing the standard error term in the above equation will require us to use the Delta Method
By the end of this course you should know the Delta Method in your sleep

The Delta Method (B.2)

Suppose we can find the variance of \(\mathbf{\widehat{\theta}}=\widehat{\theta}_{1},...,\widehat{\theta}_{r}\)
But, we want to find the variance of some function of \(\mathbf{\theta}\), say \(g(\mathbf{\widehat{\theta}})\)
- \(g(\mathbf{\widehat{\theta}})=\log[\theta]\)
- \(g(\mathbf{\widehat{\theta}})=\theta^2\)
If we can find \(\frac{dg(\theta)}{d\theta}\), the Delta Method can help us estimate \(\widehat{Var}[g(\mathbf{\widehat{\theta}})]\) from \(\widehat{Var}[\mathbf{\widehat{\theta}}]\)

Using The Delta Method Requires 4 Things

A parameter for which we know the variance - \(\theta\)
The variance of the paramter - \(Var[\theta]\)
A function of the parameter - \(g(\theta)\)
The partial derivatives - \(\frac{\partial g(\theta)}{\partial \theta_{i}}, \;\; i=1,...m\)

The General Delta Method Equation

\[ Var\left[g(\widehat{\theta})\right]\approx \sum_{i=1}^{r} \left[\frac{\partial g(\mathbf{\theta})}{\partial \theta_{i}}\right]^{2} Var(\widehat{\theta_{i}})+\sum_{i=1}^r \mathop{\sum^{r}_{j=1}}_{i\ne j}\left[\frac{\partial g(\mathbf{\theta})}{\partial \theta_{i}}\right]\left[\frac{\partial g(\mathbf{\theta})}{\partial \theta_{j}}\right] Cov(\widehat{\theta}_{i}, \widehat{\theta}_{j}) \]

3.6.1 - Approximate Variance of \(\widehat{F}(t_{i})\) - Example

Example - Derivation of Equation 3.8

The text states that using the Delta Method with \(\theta=q_j\) and \(g(\theta)=S(t_i)\) results in

\[ \displaystyle \widehat{S}(t_{i})\approx S(t_i)+ \sum_{j=1}^{i}\frac{\partial S}{\partial q_{j}}\vert_{q_{j}}(\widehat{q}_{j}-q_{j}) \]

However, there’s not much detail showing how this result was achieved
The following example walks through the derivation of Equation 3.8

First, We Must Identify The Parameter \(\theta\)

Our goal is to find a expression for \(\widehat{Var}[\widehat{F}(t_i)]\)
- From Equation 3.6, we see that \(\widehat{S}(t_i)\) is a function of \(\widehat{p_j}\)
- Since \(\widehat{F}(t_{i})=1-\widehat{S}(t_{i})\), we know that \(Var[\widehat{F}(t_{i})]=Var[\widehat{S}(t_{i})]\)
- Similarly, \(\widehat{q_j}=1-\widehat{p_j}\) and \(Var[\widehat{q_j}]=Var[\widehat{p_j}]\)
Therefore, \(\theta = p_{j} \;\text{or}\;q_{j}\)

Next, Find The Variance of The Parameter \(Var[\theta]\)

Since \(q_{j}=1-p_{j}\), we know that \(Var[q_{j}]=Var[p_{j}]\)
Recall that \(\widehat{p}=\frac{x}{n}\) where \(x\) is the number of observed “successes”
Also, note that \(\widehat{p}\) is an unbiased estimator for \(p\)

\[E[\widehat{p}]=E\left[\frac{x}{n}\right]=\frac{E[x]}{n}=\frac{np}{n}=p\]

Since \(\widehat{p}\) is an unbiased estimator for \(p\),

\[ \begin{aligned} Var[\widehat{p}]&=Var[p]\\\\ &=Var\left[\frac{x}{n}\right]=\frac{1}{n^{2}}Var[x]=\frac{Var[x]}{n^{2}}\\\\\\ &=\frac{np(1-p)}{n^{2}}=\frac{p(1-p)}{n}\\ \end{aligned} \]

Therefore, \(Var[p]=\frac{p(1-p)}{n}=\frac{1-q(q)}{n}\)

Now, What is The Function of the Parameter\(g(\theta)?\)

From Equation 3.6

\[ \begin{aligned} S(t_{i})&=\prod_{j=1}^{i}[1-p_{j}], i=1,...,m \\ &=(1-p_{1})(1-p_{2})...(1-p_{i})\\ &=(q_{1})(q_{2})...(q_{i}) \end{aligned} \]

Therefore, \(g[p]=\prod_{j=1}^{i}[1-p_{j}]=\prod_{j=1}^{i}[q_{j}], i=1,...,m\)

Finally, What is \(\frac{\partial g(\theta_{i})}{\partial \theta_{i}}, \;\; i=1,...m\)?

In this case, it’s easier to compute the derivatives \(\frac{\partial S(t_{i})}{\partial q_{i}}, \;\; i=1,...m\)
We know that \(\forall i \in 1,2,...m\)

\[ \frac{\partial S(t_{i})}{\partial q_{i}} \;=\; \frac{\partial \left( q_{1}q_{2}...q_{i-1}q_{i}\right)}{\partial q_{i}} \;=\; q_{1}q_{2}...q_{i-1} \;=\; \frac{S(t_{i})}{q_{i}} \]

Therefore, \(\frac{\partial g(\theta_{i})}{\partial \theta_{i}}, \;\; i=1,...m=\frac{S(t_{i})}{q_{i}}\)

Putting Everything Together

\[ \begin{aligned} Var[S(t_{i})]&=\sum_{j=1}^{i}\left[\frac{\partial S(t_{i})}{\partial q_{j}}\right]^{2}Var(q_{j})\\ &=\sum_{j=1}^{i}\left[\frac{S(t_{i})}{q_{j}}\right]^{2}\frac{p_{j}(1-p_{j})}{n_{j}}\\ &=S(t_{i})^{2}\sum_{j=1}^{i}\frac{p_{j}(1-p_{j})}{n_{j}(1-p_{j})^{2}}\\ &=S(t_{i})^{2}\sum_{j=1}^{i}\frac{p_{j}}{n_{j}(1-p_{j})}\\ \end{aligned} \]

Greenwood’s formula

Substituting the estimated values into (3.8) gives what is known as Greenwood’s formula

\[ \displaystyle \widehat{Var}\left[\widehat{F}(t_{i})\right]=\widehat{Var}\left[\widehat{S}(t_i)\right]=\left[\widehat{S}(t_{i})\right]^{2}\sum_{j=1}^{i}\frac{\widehat{p}_{j}}{n_{j}(1-\widehat{p_{j}})} \]

Greenwood’s formula can then be used to estimate the standard error of \(\widehat{F}(t_{i})\) as

\[ \widehat{se}_{\widehat{F}}=\sqrt{\widehat{Var}[\widehat{F}(t_{i})]} \]

3.6.3 - Pointwise Normal-Approximate CI for \(F(t_i)\)

The pointwise \(1-\alpha\) confidence intervals for multiply censored data are then expressed as

\[ \left[\underline{F}(t_i),\overline{F}(t_i)\right]=\widehat{F}(t_i)\pm z_{(1-\alpha/2)}\widehat{se}_{\widehat{F}} \]

Under the assumption that for large sample sizes

\[ Z_{\widehat{F}}=\frac{\widehat{F}(t_i)-F(t_i)}{\widehat{se}_{\widehat{F}}}\xrightarrow d N(0,1) \text{ as } n\rightarrow\infty \]

However, for small sample sizes this assumption may fail if the distribution of \(Z_{\widehat{F}}\) is highly skewed
A better approximation for \(\widehat{F}(t_{i})\) results from a logit transformation such that

\[ Z_{\text{logit}(\widehat{F})}=\frac{\text{logit}(\widehat{F}(t_i))-\text{logit}(F(t_i))}{\widehat{se}_{\text{logit}(\widehat{F})}} \]

The transformed pointwise \(1-\alpha\) CI for multiply censored data are then

\[ \left[\underline{\underline{F}}(t_i),\overline{\overline{F}}(t_i)\right]=\left[\frac{\widehat{F}(t_i)}{\widehat{F}(t_i)+(1-\widehat{F}(t_i)\times w}, \frac{\widehat{F}(t_i)}{\widehat{F}(t_i)+(1-\widehat{F}(t_i)/ w}\right] \]

where

\[ w=\exp\left[\frac{z_{(1-\alpha/2)}\widehat{se}_{\widehat{F}}}{\widehat{F}(1-\widehat{F})}\right] \]

Example 3.7 - CI for Pooled Heat Exchanger Data

Recall Table 3.2

\[ \begin{array}{lrrrrllrr} \hline Year & t\_i & d\_i & r\_i & n\_i & p\_i & 1-p\_i & S(t\_i) & F(t\_i) \\ \hline (0-1] & 1 & 4 & 99 & 300 & 4/300 & 296/300 & 0.9867 & 0.0133 \\ (1-2] & 2 & 5 & 95 & 197 & 5/197 & 192/197 & 0.9616 & 0.0384 \\ (2-3] & 3 & 2 & 95 & 97 & 2/97 & 95/97 & 0.9418 & 0.0582 \\ \hline \end{array} \]

Therefore

\[ \begin{aligned} \widehat{Var}(\widehat{F}(1))&=[\widehat{S}(t_{i})]^{2}\sum_{j=1}^{i}\left[\frac{\widehat{p}_{j}}{n_{j}(1-p_{j})}\right]\\ &=(.9867)^{2}\left[\frac{.0133}{300(.9867)}\right]\\ &=.0000438 \end{aligned} \]

The estimated standard error is then found by \(\widehat{se}_{\widehat{F}}=\sqrt{.0000438}=.00662\)
And the \(95\%\) pointwise CI is

\[ \left[\underline{F}(1),\overline{F}(1)\right]=.0133\pm 1.960\times .00662=[.0003, .0263] \]

Applying the logit transformation to \(\widehat{F}(t_{i})\) gives

\[ \begin{aligned} \left[\underline{\underline{F}}(1),\overline{\overline{F}}(1)\right]&=\left[\frac{.0133}{.0133+(1-.0133)\times 2.6878}, \frac{.0133}{.0133+(1-.0133)/ 2.6878}\right]\\\\\\ &=[.0050, .0350]\\\\ \end{aligned} \]

where

\[ w=\exp\left[\frac{1.960(.00662)}{.0133(1-.0133)}\right]=2.86687816 \]

An interactive table of values for \(\widehat{F}(t_{i})\) can be produced using

This output may then be converted to a \(LaTeX\) table using

xtable::xtable(cdf_table, 
               include.rownames = FALSE, 
               digits = c(0,0,0,5,5,5,5))

\[ \begin{array}{rrrrrr} \hline Years-lower & Years-upper & Fhat & SE\_Fhat & 95\% Lower & 95\% Upper \\ \hline 1 & 1 & 0.01333 & 0.00662 & 0.00501 & 0.03498 \\ 2 & 2 & 0.03838 & 0.01280 & 0.01982 & 0.07302 \\ 3 & 3 & 0.05820 & 0.01870 & 0.03069 & 0.10763 \\ \hline \end{array} \]

3.7 - Estimation From Multiply Censored Data With Exact Failures

Multiply Censored Data With Exact Failures

“Exact” failures are an abstraction, but can be considered to have occurred in reality if
- Test units are under continuous inspection (some electronic systems)
- A large number of inspections are performed at closely spaced intervals
In a test with exact failures, most inspections will not discover a failure, therefore
- The value of \(F(t)\) will not change
- The plot of \(\widehat{F}(t)\) will appear as a step-function

Example 3.9 - Shock Absorber Data

Table 3.4

ShockAbsorber.ld <- frame.to.ld(shockabsorber, 
                                response.column = 1, 
                                censor.column = 3,
                                time.units = 'Kilometers')
plot(ShockAbsorber.ld, band.type = 'Pointwise')

\[ \begin{array}{rrrrrr} \hline Kilometers-lower & Kilometers-upper & Fhat & SE\_Fhat & 95\% Lower & 95\% Upper \\ \hline 0 & 6700 & 0.00000 & 0.00000 & 0.00000 & 0.00000 \\ 6700 & 9120 & 0.02632 & 0.02597 & 0.00369 & 0.16457 \\ 9120 & 12200 & 0.05495 & 0.03783 & 0.01376 & 0.19513 \\ 12200 & 13150 & 0.09130 & 0.05093 & 0.02929 & 0.25073 \\ 13150 & 14300 & 0.12916 & 0.06128 & 0.04851 & 0.30143 \\ 14300 & 17520 & 0.17271 & 0.07205 & 0.07210 & 0.35933 \\ 17520 & 20100 & 0.21625 & 0.08034 & 0.09826 & 0.41130 \\ 20100 & 20900 & 0.28156 & 0.09661 & 0.13321 & 0.49984 \\ 20900 & 22700 & 0.37137 & 0.11918 & 0.17844 & 0.61638 \\ 22700 & 26510 & 0.46117 & 0.13171 & 0.23246 & 0.70749 \\ 26510 & 27490 & 0.56894 & 0.14281 & 0.29656 & 0.80515 \\ 27490 & 28100 & 0.71262 & 0.15109 & 0.36869 & 0.91327 \\ \hline \end{array} \]

Statistical Methods for Reliability Data

Chapter 3 - Nonparametric Estimation

OVERVIEW

This chapter explains…

3.1 - INTRODUCTION

Nonparametric estimation

Nonparametric techniques

3.2 - ESTIMATION FROM SINGLY CENSORED DATA

Singly censored data

3.2 - ESTIMATION FROM SINGLY CENSORED DATA - Example

Example 3.1 & 3.2 - Plant 1 Heat Exchanger Data

3.2 - ESTIMATION FROM SINGLY CENSORED DATA

Recall, the cdf \(F(t)\) is defined everywhere in \(\mathbb{R}\), i.e. \((-\infty, \infty)\)

3.3.2 - Confidence Intervals

Sampling errors

Sources of uncertainty

3.3.2 - Confidence Intervals - Example

Understanding Confidence Intervals

Exact CI vs Approximate CI

3.4.1 - Pointwise binomial-based CI for \(F(t_i)\)

Clopper-Pearson CI For Proportions

3.4.1 - Pointwise binomial-based CI for \(F(t_i)\) - Example

Example 3.3 - Binomial CI for \(F(t_i)\)

3.4.2 - Pointwise Normal-approximate CI for \(F(t_i)\)

Confidence Intervals Based on the Normal Distribution

3.4.2 - Pointwise Normal-approximate CI for \(F(t_i)\) - Example

Example 3.4 - Normal-approx CI for Plant 1 \(F(t_i)\)

Table 3.1

3.4.2 - Pointwise Normal-approximate CI for \(F(t_i)\) - Example 3.5

Integrated Circuit Life Test Data

3.5 - ESTIMATION FROM MULTIPLY CENSORED DATA

Multiply censored data results when

Figure 3.4 - Pooled heat exchanger data

3.5 - Estimation from multiply censored data

Fig 1.7 - Heat exchanger crack inspection data

Table 3.2 - Pooled Heat Exchanger Data

3.6 - POINTWISE CI FROM MULTIPLY CENSORED DATA

The Delta Method (B.2)

Using The Delta Method Requires 4 Things

The General Delta Method Equation

3.6.1 - Approximate Variance of \(\widehat{F}(t_{i})\) - Example

Example - Derivation of Equation 3.8

First, We Must Identify The Parameter \(\theta\)

Next, Find The Variance of The Parameter \(Var[\theta]\)

Now, What is The Function of the Parameter\(g(\theta)?\)

Finally, What is \(\frac{\partial g(\theta_{i})}{\partial \theta_{i}}, \;\; i=1,...m\)?

Putting Everything Together

Greenwood’s formula

3.6.3 - Pointwise Normal-Approximate CI for \(F(t_i)\)

Example 3.7 - CI for Pooled Heat Exchanger Data

Recall Table 3.2

3.7 - Estimation From Multiply Censored Data With Exact Failures

Multiply Censored Data With Exact Failures

Example 3.9 - Shock Absorber Data

Table 3.4

3.8 - SIMULTANEOUS CONFIDENCE BANDS

Pointwise confidence intervals \((\S \;\;\text{3.6.3})\)

Simultaneous confidence intervals

Simultaneous & Pointwise CI’s

3.8.2 - Large-Sample Simultaneous CI for \(F(t)\)

Table 3.5 - Factors for Simultaneous CI

3.8.3 - Time Range for Simultaneous CI for \(F(t)\)

more to come