Example 1: A randomized clinical trial (RCT) was conducted to compare the efficacy of subcutaneous heparin versus Warfarin in preventing new episodes of deep vein thrombosis (DVT) in patients admitted to hospital for treatment of an initial episode. Patients were followed for 90 days following initial therapy. Of 101 patients assigned to Warfarin, 22 experienced new episodes compared to 10 episodes in 98 patients assigned to heparin.
DVT | noDVT | Total | |
---|---|---|---|
Warfarin | a=22 | b=79 | a+b=101 |
Heparin | c=10 | d=88 | c+d=98 |
Total | a+c=32 | b+d=167 | n=199 |
Y = number of “successes“ in sample size n independent observations Y=n∑i=1yi
ˆp=Yn estimate of true proportion, p, in population
S.E(ˆp)=√ˆp(1−ˆp)n
Based on normal approximation to binomial distribution, large sample confidence interval:
Small sample confidence interval?
Significance tests?
Let p1 and p2 represent the risk of DVT for subjects receiving warfarin and heparin respectively.
Two independent estimates ^p1 and ^p2, based on samples sizes n1 and n2, with corresponding population proportions p1 and p2 from different populations.
Estimated difference (“risk difference”) = ˆp1−ˆp2
Standard error: S.E.=(ˆp1−ˆp2)=√ˆp1(1−ˆp1)n1+ˆp2(1−ˆp2)n2
Confidence interval?
Small sample (exact) test?
Inference for paired binary data?
RR=p1p2
most relevant for cohort type studies (e.g. RCT)
RR=1 implies equal risks (i.e. p1=p2)
^RR=^p1^p2 where ^p1=aa+b and ^p2=cc+d.
SE(ln^RR)=√1a−1a+b+1c−1c+d
95% CI (large sample): ^RRe±1.96SE(ln^RR)
Test of RR=1: the usual χ2 test on 1 d.f (or Fisher’s test in small samples)
## Outcome + Outcome - Total Inc risk *
## Exposed + 22 79 101 21.8
## Exposed - 10 88 98 10.2
## Total 32 167 199 16.1
## Odds
## Exposed + 0.278
## Exposed - 0.114
## Total 0.192
##
## Point estimates and 95 % CIs:
## -------------------------------------------------------------------
## Inc risk ratio 2.13 (1.07, 4.27)
## Odds ratio 2.45 (1.09, 5.49)
## Attrib risk * 11.58 (1.54, 21.61)
## Attrib risk in population * 5.88 (-2.00, 13.75)
## Attrib fraction in exposed (%) 53.15 (6.24, 76.60)
## Attrib fraction in population (%) 36.54 (-2.52, 60.72)
## -------------------------------------------------------------------
## X2 test statistic: 4.941 p-value: 0.026
## Wald confidence limits
## * Outcomes per 100 population units
Example 2: In a landmark study published in 1950 by Richard Doll and Austin Bradford Hill in the U.K. 60 female lung cancer patients were compared with 60 similarly aged healthy patients. 41 of the cancer patients were established smokers compared to 28 in the control group.
The odds relative to a probability p = p1−p.
The odds ratio for comparing two probabilities p1 and p2 is OR=p1/(1−p1)p2/(2−p2)
Motivation?
All-purpose measure of association which applies to all designs, esp. case control, cross-sectional
OR=1 implies no association, p1=p2.
Approximation to RR under Rare Disease Assumption
Mathematically “elegant”: easy to work with
Inference for OR ^OR=^p1/(1−^p1)^p2/(2−^p2)=adbc
SE(ln^OR)=√1a+1b+1c+1d
95% CI (large sample): ^ORe±1.96SE(ln^OR)
Test of OR=1: same as for RR
## Outcome + Outcome - Total Prevalence *
## Exposed + 41 28 69 59.4
## Exposed - 19 32 51 37.3
## Total 60 60 120 50.0
## Odds
## Exposed + 1.464
## Exposed - 0.594
## Total 1.000
##
## Point estimates and 95 % CIs:
## -------------------------------------------------------------------
## Odds ratio (W) 2.47 (1.17, 5.19)
## Attrib prevalence * 22.17 (4.55, 39.78)
## Attrib prevalence in population * 12.75 (-3.26, 28.75)
## Attrib fraction (est) in exposed (%) 59.14 (9.19, 82.02)
## Attrib fraction (est) in population (%) 40.62 (7.75, 61.79)
## -------------------------------------------------------------------
## X2 test statistic: 5.763 p-value: 0.016
## Wald confidence limits
## * Outcomes per 100 population units
Can express null hypothesis of association three ways
p1−p2=0 or RR=1 or OR=1
Can test using Fisher’s exact test or Normal Theory test (Large sample)
X2=∑i=1,2∑j=1,2(Oij−Eij)2Eij
Oij = observed frequency in row i column j
Eij = expected count under hypothesis of no association
exactly equal to square of Z for testing p1=p2 with SE adapted to Null Hypothesis
Yate’s corrected chi-square = modification of above to work better for small samples
X2=∑(|O−E|−.5)2E