Epidemiology course, ITMO PHS
10/20/23
Real things (substances), constructs
Measurements
Gold standard
| Measured positive | Measured negative | |
|---|---|---|
| Actually positive (P) | True Positive (TP) | False Negative (TN) (Type II error) |
| Actually negative (N) | False Positive (FP) (Type I error) |
True Negative (FN) |
Sensitivity = True Positive Rate (TPR) = True positive / Actually positive
Specificity = True Negative Rate (TNR) = True Negative / Actually Negative
Positive predictive value = True Positive / Measured Positive
Negative predictive value = True Negative / Measured Negative
100% sensitivity, 100% specificity
No measurement error
Drawback:
doesn’t exist
100% sensitive test
100% specific test
50% sensitive, 50% specific
0% sensitive, 0% specific
Low sensitivity – many cases are missed, false reassurement
Low specificity – many cases are harmed by misdiagnosed and mistreatment
PPV depends on prevalence
Sensitivity 0,9, specificity 0.9
| Number | Positive tests | Negative tests | |
|---|---|---|---|
| With disease | 100 | ||
| Without disease | 10 000 | ||
| Total | 10 100 |
PPV =
Sensitivity 0,9, specificity 0.9
| Number | Positive tests | Negative tests | |
|---|---|---|---|
| With disease | 100 | 90 | 10 |
| Without disease | 10 000 | 1 000 | 9 000 |
| Total | 10 100 | 1 090 | 9 010 |
90 of 1090 are truly positive, PPV = 90 / 1090 = 0.08 (8%)
Sensitivity 0,9, specificity 0.9
| Number | Positive tests | Negative tests | |
|---|---|---|---|
| With disease | 1 000 | ||
| Without disease | 10 000 | ||
| Total | 11 000 |
PPV =
Sensitivity 0,9, specificity 0.9
| Number | Positive tests | Negative tests | |
|---|---|---|---|
| With disease | 1 000 | 900 | 100 |
| Without disease | 10 000 | 1 000 | 9 000 |
| Total | 11 000 | 1 900 | 9 100 |
900 of 1900 are truly positive, PPV = 900 / 1900 = 0.47 (47%)
Cheap, easy tests are non-specific
Specific test are costly, invasive
Sequential testing:
0) perform screening in high risk population
1) use more sensitive, less specific tests to screen
2) use more specific tests to confirm
“All screening programmes do harm; some do good as well, and, of these, some do more good than harm at reasonable cost.”
Gray, Patnick, and Blanks (2008)
\[ O(A|B) = O(A) \times \Lambda(A|B) \]
Positive likelihood ratio = TPR / FPR
Negative likelihood ratio = TNR / FNR
Posterior odds = Prior odds \(\times\) Positive LR
Never creates spurious associations (only by chance)
Calorimetric method of glucose and cholesterol measurement
Diabetes – glucose > 7 mmol/l
Hypercholesterolemia – cholesterol > 6 mmol/l
Error is introduced by non-calibrated dilution of the sample, affecting both glucose and cholesterol measurement (in the same direction)
df <- data.frame(glucose = rnorm(1000, 5.5, 1),
cholesterol = rnorm(1000, 4.5, 1),
addition_1 = rnorm(1000, 0, 1),
addition_2 = rnorm(1000, 0, 1))
# exclude negative values
df <- df[df$glucose>0 & df$cholesterol>0,]
g <- df %>% ggplot(aes(x=glucose, y=cholesterol)) +
geom_point(color="blue", alpha=0.5) +
# geom_point(aes(x=glu1, y=chol1), color="blue", alpha=0.5) +
xlim(0,10) + ylim(0,10) +
geom_hline(yintercept=6, linetype="dashed") +
geom_vline(xintercept=7, linetype="dashed") +
theme_bw()
g# add random but independent error
df$glu1 <- df$glucose + df$addition_1
df$chol1 <- df$cholesterol + df$addition_2
g1 <- df %>% ggplot(aes(x=glu1, y=chol1)) +
geom_point(color="blue", alpha=0.5) + xlim(0,10) + ylim(0,10) +
geom_hline(yintercept=6, linetype="dashed") +
geom_vline(xintercept=7, linetype="dashed") +
labs(x="glucose, misclassified", y="cholesterol, misclassified",
title="Independent misclassification") +
theme_bw()
g1# add random but independent error
df$glu2 <- df$glucose + df$addition_1
df$chol2 <- df$cholesterol + df$addition_1
g2 <- df %>% ggplot(aes(x=glu2, y=chol2)) +
geom_point(color="blue", alpha=0.5) + xlim(0,10) + ylim(0,10) +
geom_hline(yintercept=6, linetype="dashed") +
geom_vline(xintercept=7, linetype="dashed") +
labs(x="glucose, misclassified", y="cholesterol, misclassified",
title="Dependent misclassification") +
theme_bw()
g2 hypercholesterolemia
diabetes FALSE TRUE
TRUE 125 16
FALSE 731 128
Odds ratio
estimate lower upper
0.74 0.41 1.25
hypercholesterolemia
diabetes FALSE TRUE
TRUE 89 52
FALSE 764 95
Odds ratio
estimate lower upper
4.69 3.12 7.02
Deiffrential misclassification:
misclassification of exposure depends on outcome
misclassification of outcome depends on exposure
(or both)
Misclassification (Sp/Sn) is unequal accross rows or coulmns of contingency table
| Outcome | No outcome | ||
|---|---|---|---|
| Exposure | A | \(\rightarrow\) \(\leftarrow\) |
B |
| \(\downarrow \uparrow\) | \(\uparrow \downarrow\) | ||
| No exposure | C | \(\leftarrow\) \(\rightarrow\) |
D |
\[ \frac{A / B}{C / D} = \frac{A \times D}{B \times C} \]
Park et al. (2016)
Lead-time bias
Sticky diagnosis bias
Recall bias
EXCEL Trial Stone et al. (2019)
CABG vs. PCI for left main artery disease
Long rival between surgeons and interventional cardiologists
Left main artery disease was the last frontier for PCI (percutaneous intervention)
EXCEL Trial have shown non-inferiority of PCI for left main disease
Surgeon David Taggart set the EACTS meeting ablaze when he accused EXCEL researchers of stacking the deck in PCI’s favor.
Gregson et al. (2020)
With respect to all-cause mortality, 18 of the 30 excess deaths at 5 years were deemed noncardiovascular, said Stone, and there was no significant difference in the risk of cardiovascular death, which was 6.8% in the PCI arm and 5.5% in the CABG group (OR 1.26; 95% 0.85-1.85).
https://www.tctmd.com/news/former-excel-investigator-alleges-trial-manipulation-prompting-vehement-denials
Barchuk et al. (2022)
We included 1,254 cases and 2,747 controls recruited between the 6th and 14th of October in the final analysis. VE was 56% (95% CI: 48 to 63) for Gam-COVID-Vac (Sputnik V), 49% (95% CI: 29 to 63) for 1-dose Gam-COVID-Vac (Sputnik V) or Sputnik Light, -58% (95% CI: -225 to 23) for EpiVacCorona and 40% (95% CI: 3 to 63) for CoviVac. Without adjustment for the history of confirmed COVID-19 VE for all vaccines was lower, except for one-dose Gam-COVID-Vac (Sputnik Light). The adjusted VE was slightly lower in women — 51% (95% CI: 39 to 60) than men — 65% (95% CI: 5 to 73).
Control for dependent misclassification
Blinded assessment
Formal questionnaires
Validation samples
Different sources of data
Valid surrogate end-points
etc. :)
‘If you read the inscription buffalo on an elephant’s cage, do not believe your eyes.’
Kozma Prutkov
Hernan and Cole (2009)
VanderWeele and Hernan (2012)