Interpretation of Proportional Hazards Regression Models
Interpretation of Proportional Hazards Regression Models
Interpretation of proportional
hazards regression models
Interpretation of regression coefficients
Confidence intervals of ratio of hazards
Covariate adjusted survival functions and their
applications
• Hazard ratio
Let h(t|x1 ) and h(t|x2 ) be the hazard functions given covariate
x1 and x2 respectively. Define
h(t|x1 )
r(t|x1 , x2) = .
h(t|x2 )
For convenience, we call the ratio r(t|x1 , x2 ) as the hazard ratio
of x1 with respect to x2 .
1
• Interpretation of coefficients for nominal covariates
j = 1, . . . , k − 1.
Category x h(t|x)
1 (1, 0, . . . , 0) h0 (t) exp(β1)
2 (0, 1, . . . , 0) h0 (t) exp(β2)
··· ··· ···
k−1 (0, 0, . . . , 1) h0 (t) exp(βk−1)
k (0, 0, . . . , 0) h0 (t)
(ii) For any two categories j and l which are not baseline categories,
the hazard ratio of category j with respect to category l is given
by
h0 (t) exp(βj )
r(t|j, l) = = exp(βj − βl ).
h0 (t) exp(βl )
Hence
βj − βl = ln[r(t|j, l)].
4
Thus cβ is the log hazard ratio when the covariate value increases
by c units. In particular, β is the log hazard ratio when the co-
variate value increases by 1 unit.
5
For patients at stages I, II, III and IV, the hazard functions are,
respectively,
6
The estimated hazard ratios:
Category III w.r.t. I: e0.622 = 1.862.
Category IV w.r.t. I: e1.754 = 5.775.
Category II w.r.t. I at age 65: e−7.381+0.112×65 = 0.904.
Category II w.r.t. I at age 70: e−7.381+0.112×70 = 1.581.
where
σ̂ 2(β̂j − β̂l ) = Var(
ˆ β̂j ) + Var(
ˆ β̂l ) − 2Cov(
ˆ β̂j , β̂l ).
7
• Confidence interval for hazard ratios involving contin-
uous covariates
100(1-α)% confidence interval for quantity of the form eβj −βl +xβm :
[exp{β̂j − β̂l + xβ̂m − z1−α/2 σ̂(β̂j − β̂l + xβ̂m )},
exp{β̂j − β̂l + xβ̂m + z1−α/2 σ̂(β̂j − β̂l + xβ̂m)}],
where
⎛ ⎞
1
σ̂ 2(β̂j − β̂l ) = (1, −1, x)Σjlm ⎝ −1 ⎠ ,
x
and Σjlm is the covariance matrix of β̂j , β̂l , β̂m)
8
• Example: Clinical trial on laryngeal cancer (cont.)
The variance matrix of fitted β̂:
9
Confidence intervals for hazard ratio of Stage III rel-
ative to stage IV :
Interval for log hazard ratio:
(0.62156 − 1.7535) ± 1.96 0.1226 + 0.1797 − 2(0.0682)
x_c(1,0,0,51)
b_larynx.fit$coef
Sx_S0^(exp(t(x)%*%b))
10
To demonstrate the differences among different treatments (or
groups), it needs to compute the average covariate adjusted sur-
vival functions within each group. The average is given by
1
[Ŝ0 (t)]exp(xi β̂ ),
t
Sj =
Nj
xi∈Group j
where Nj is the number of individuals in group j.
The fitted linear predictor {xti β̂, i = 1, . . . , n} for all individuals
can be extracted from a coxph object as in the example below:
xb_larynx.fit$linear
S1_S0^( mean(exp(xb[group==1])) )
h0_coxph.detail(larynx.fit)$hazard
H0_0
S0_NULL
for (i in 1:length(h0)) {
H0_H0+h0[i]
S0[i]_exp(-H0) }
11
xb_larynx.fit$linear
group_larynx.ext$V1
S1_S0^( mean(exp(xb[group==1])) )
S2_S0^( mean(exp(xb[group==2])) )
S3_S0^( mean(exp(xb[group==3])) )
event_coxph.detail(larynx.fit)$time
matplot(event,cbind(S1,S2,S3,S4),type="l")
12