ROC class notes

Le Kang

2024-05-01

Time-dependent ROC curve

What do we do when outcome is right censored survival time?

For a given threshold \(c\), the time-dependent sensitivity (Se) and specificity (Sp) can defined respectively by

\(Se(c,t)=P(S_i>c|D_i(t)=1)\)
\(Sp(c,t)=P(S_i\leq c|D_i(t)=0)\)

At any time \(t\), \[AUC(t)=\int_{-\infty}^{\infty}Se(c,t)d[1-Sp(c,t)]\]

Cumulative sensitivity and dynamic specificity (C/D)

At each time point \(t\), each individual is classified as a case or control. A case is defined as an individual with an event up to time \(t\), while a control is an individual remaining event-free at time \(t\).

\(Se^{C}(c,t)=P(S_i>c|T_i\leq t)\)
\(Sp^{D}(c,t)=P(S_i\leq c|T_i>t)\)

\[AUC^{C/D}(t)=P(S_i>S_j|T_i\leq t,T_j>t), i\neq j.\]

Naive estimators

\(\widehat{Se}^{C}(c,t)=\dfrac{\sum_i\delta_i I(S_i>c,T_i\leq t)}{\sum_i\delta_i I(T_i\leq t)}\)
\(\widehat{Sp}^{D}(c,t)=\dfrac{\sum_i I(S_i\leq c,T_i> t)}{\sum_i I(T_i> t)}\)

Why it is biased?

Kaplan-Meier estimator of Heagerty et al.

\(\widehat{Se}^{C}(c,t)=\hat{P}(S_i>c|T_i\leq t)=\dfrac{\left\{1-\hat{S}(t|S_i>c)\right\}\left(1-\hat{F}_S(c)\right)}{1-\hat{S}(t)}\)
\(\widehat{Sp}^{D}(c,t)=\hat{P}(S_i\leq c|T_i>t)=\dfrac{\hat{S}(t|S_i\leq c)\hat{F}_S(c)}{\hat{S}(t)}\)

\(\hat{S}(t)\) is the estimated survival function, \(\hat{S}(t|S_i>c)\) is the estimated conditional survival function for the subset defined by \(S > c\).

It is more appropriate to apply the C/D definitions when there is a specific time of interest that is used to discriminate between individuals experiencing the event and those event-free prior to the specific time.

However, since some individuals may contribute as controls at an earlier time and then contribute as cases later, this definition uses redundant information in separating cases and controls.

Incident sensitivity and dynamic specificity (I/D)

A case is defined as an individual with an event right at time \(t\), while the control is an event-free individual at time \(t\).

\(Se^{I}(c,t)=P(S_i>c|T_i=t)\)
\(Sp^{D}(c,t)=P(S_i\leq c|T_i>t)\)

\[AUC^{I/D}(t)=P(S_i>S_j|T_i= t,T_j>t), i\neq j.\]

In this definition, there are individuals neither a control nor case (when the event time is less than the target time, i.e. \(T_i < t\)).

Each individual who had an event may play the role of control at the earlier time (when the event time is greater than target time, i.e., \(T_i > t\)) but then contributes as a case at the later incident time (when the event time is the same as the target time, i.e., \(T_i = t\)).

The I/D is more appropriate when the exact event time is known and we want to discriminate between individuals experiencing the event and those event-free at a given event-time, i.e. \(T_i = t\).

This is essentially dichotomizing the risk set at time \(t\) into cases and controls, a natural companion to hazard models.

This also allows time-averaged concordance measure \(C\)-index. This is a special advantage of the I/D definition, since in many applications no a prior time \(t\) is identified, thus a global accuracy summary is usually desired.

Outcomes with no censoring

First, assume that the survival time \(X\) is actually observed without any censoring, i.e., \(\delta_i=1\) (\(i=1, 2, \ldots, n\)). Upon randomly drawing a pair of subjects, say \((i, j)\), \(i\neq j\), we may have five types of pairs between the survival time \(X\) and the predictive score \(Y\).

a concordance with probability \(\Pi_c=P(X_i<X_j ~\text{and}~ Y_i<Y_j ~\text{or}~ X_i>X_j ~\text{and}~ Y_i>Y_j)\);
a discordance with probability \(\Pi_d=P(X_i<X_j ~\text{and}~ Y_i>Y_j ~\text{or}~ X_i>X_j ~\text{and}~ Y_i<Y_j)\);
an \(X\)-only tie with probability \(\Pi_{tX}=P(X_i=X_j ~\text{and}~ Y_i>Y_j ~\text{or}~ X_i=X_j ~\text{and}~ Y_i<Y_j)\);
a \(Y\)-only tie with probability \(\Pi_{tY}=P(X_i<X_j ~\text{and}~ Y_i=Y_j ~\text{or}~ X_i>X_j ~\text{and}~ Y_i=Y_j)\);
a joint tie in both \(X\) and \(Y\) with probability \(\Pi_{tXY}=P(X_i=X_j ~\text{and}~ Y_i=Y_j )\).

These five possibilities for a random pair are comprehensive and mutually exclusive, and therefore \[\Pi_c+\Pi_d+\Pi_{tX}+\Pi_{tY}+\Pi_{tXY}=1.\]

Kim’s measure \(d_{X\cdot Y}\), \[d_{X\cdot Y}=\dfrac{\Pi_c-\Pi_d}{\Pi_c+\Pi_d+\Pi_{tY}}=\dfrac{\Pi_c-\Pi_d}{1-\Pi_{tX}-\Pi_{tXY}}\] is the probability of a concordance minus the probability of a discordance, both conditioned on the occurrence of distinct values of outcome \(X\), for quantifying the degree of relationship between \(X\) and \(Y\).

Outcomes with right-censoring

Define \(sign\) and \(csign\) (\(sign\) with censoring) functions as below, \[ sign\left( Y_i, Y_j\right) = I(Y_i > Y_j) - I(Y_i < Y_j)\] \[ csign\left(X_i, \delta_i, X_j, \delta_j\right) = I(X_i \geq X_j)\delta_j - I(X_i \leq X_j)\delta_i \]

The order of two survival times \(X_i\) and \(X_j\) can be unambiguously determined if and only if \(csign\left(X_i, \delta_i, X_j, \delta_j\right)\neq 0\).

a generalized concordance with probability \(\Pi_c^g=P(csign\left(X_i, \delta_i, X_j, \delta_j\right) sign\left(Y_i,Y_j\right)=1)\);
a generalized discordance with probability \(\Pi_d^g=P(csign\left(X_i, \delta_i, X_j, \delta_j\right) sign\left(Y_i,Y_j\right)=-1)\);
a generalized \(X\)-only tie with probability \(\Pi_{tX}^g=P(csign\left(X_i, \delta_i, X_j, \delta_j\right)=0,sign\left(Y_i,Y_j\right)\neq 0)\);
a generalized \(Y\)-only tie with probability \(\Pi_{tY}^g=P(csign\left(X_i, \delta_i, X_j, \delta_j\right)\neq 0,sign\left(Y_i,Y_j\right)= 0)\);
a generalized joint tie in both \(X\) and \(Y\) with probability \(\Pi_{tXY}^g=P(csign\left(X_i,\delta_i, X_j, \delta_j\right)=0,sign\left(Y_i,Y_j\right)= 0)\)

Overall survival C-index \(C_{XY}^g\)

\[ P(csign\left(X_i, \delta_i, X_j, \delta_j\right)sign\left( Y_i, Y_j\right)=1|csign\left(X_i,\delta_i, X_j, \delta_j\right)\neq0)+\\\frac{1}{2}P(sign(Y_i,Y_j)=0|csign\left(X_i,\delta_i, X_j, \delta_j\right)\neq 0)\]

\[C_{XY}^g = \dfrac{\Pi_c^g+\frac{1}{2}\Pi_{tY}^g}{\Pi_c^g+\Pi_d^g+\Pi_{tY}^g}=\dfrac{\Pi_c^g+\frac{1}{2}\Pi_{tY}^g}{1-\Pi_{tX}^g-\Pi_{tXY}^g}.\]

How to estimate \(C_{XY}^g\)?

\[C_{XY}^g = \frac{1}{2}\left(\dfrac{\Pi_c^g-\Pi_d^g}{\Pi_c^g+\Pi_d^g+\Pi_{tY}^g}+1\right)\]

Notice that,

\[\dfrac{\Pi_c^g-\Pi_d^g}{\Pi_c^g+\Pi_d^g+\Pi_{tY}^g}=\dfrac{\Pi_c^g-\Pi_d^g}{1-\Pi_{tX}^g-\Pi_{tXY}^g}\\=\dfrac{E[csign\left(X_i, \delta_i, X_j, \delta_j\right)sign\left( Y_i, Y_j\right)]}{E[csign\left(X_i, \delta_i, X_j, \delta_j\right)^2]}\]

Consider these two quantities,

\[\dfrac{1}{n(n-1)}\sum_i\sum_{j\neq i}csign\left(X_i, \delta_i, X_j, \delta_j\right)sign\left( Y_i, Y_j\right)\] \[\dfrac{1}{n(n-1)}\sum_i\sum_{j\neq i}csign\left(X_i, \delta_i, X_j, \delta_j\right)^2\]