Building a Cox Proportional Hazards Model Coursework

Exclusively available on Available only on IvyPanda® • No AI

Table of Contents

Identifying the Variables
Testing the Assumption of Proportionality Using Kaplan-Meier Method
The Cox Proportional Hazards Test
Effects of the Presence of Ties
References
Appendix 1

Identifying the Variables

In the given dataset, the censoring variable will be stroke, because the cases will be censored according to it; the cases will be right-censored if stroke=0, i.e. for those patients who did not have a stroke while having been observed.

The time-to-event variable will be followed; it shows how long the patients had been observed before they suffered from a stroke or withdrew from the study (were censored).

Tied events are events which occurred at the same time (Borucka, 2014). As can be seen from the frequency table for the variable followed in Appendix 1, there were no tied events in the data; all the 31 events where followed=32.02 are censored.

For a research question “Is there an association between hypertension and the time a person was followed before experiencing a stroke?,” hypertension is the independent variable.

#	Variable	Role in the analysis
1	stroke	Censoring and outcome (dependent) variable (event variable)
2	followed	Time-to-event variable
3	hypertension	Independent (covariate), exposure variable

Testing the Assumption of Proportionality Using Kaplan-Meier Method

The hazard plot for the given dataset using time=followed, status=stroke, and factor=hypertension, is as follows:

Testing the Assumption of Proportionality Using Kaplan-Meier Method

The plot of hazard functions can be utilized in order to evaluate whether the assumption of proportionality of hazards is violated (Forthofer, Lee, & Hernandez, 2007).

It appears that the baseline hazards (the blue line, the one which depicts hazards for the group with no hypertension) are proportional to the hazards in the tested group (the green line, which represents hazards for the groups with hypertension) at least at some intervals:

At the interval of time from 0 to approximately 10 months, the hazards do not appear proportional, for the baseline hazards remain constant and very close to 0, while the hazards in the tested group increase;
During the interval of time from nearly 10 to roughly 22 months, the hazards are proportional (apparently), for both lines grow constantly, even though the hazards in the tested group grow faster;
During the interval of time from approximately 22 to 32.02 months, the hazards also seem to be proportional, although the proportion appears to differ from that which held in the previous interval, for the hazards in the tested group go up considerably faster.

If the hazards are proportional in both the test/experimental group and the baseline group, it means that the assumption of proportional hazards is satisfied, and the Cox proportional hazards model can be built; there is no need to look for alternative tests or to try to adjust the test (Forthofer et al., 2007).

The Cox Proportional Hazards Test

Reporting and Interpreting the Test

The results of the test (time = followed, status = stroke, covariates = hypertension(Cat)) are as follows:

Categorical Variable Codings^a
		Frequency	(1)^c
Had hypertension^b	0	125	1
	1	55	0
a. Category variable: Had hypertension (hypertension)
b. Indicator Parameter Coding
c. The (0,1) variable has been recoded, so its coefficients will not be the same as for indicator (0,1) coding.

Omnibus Tests of Model Coefficients

-2 Log Likelihood

609.936

Case Processing Summary
		N	Percent
Cases available in analysis	Event^a	65	36.1%
	Censored	115	63.9%
	Total	180	100.0%
Cases dropped	Cases with missing values	0	0.0%
	Cases with negative time	0	0.0%
	Censored cases before the earliest event in a stratum	0	0.0%
	Total	0	0.0%
Total		180	100.0%
a. Dependent Variable: Time followed in Month

Omnibus Tests of Model Coefficients^a
-2 Log Likelihood	Overall (score)			Change From Previous Step			Change From Previous Block
-2 Log Likelihood	Chi-square	df	Sig.	Chi-square	df	Sig.	Chi-square	df	Sig.
592.012	20.726	1	.000	17.924	1	.000	17.924	1	.000
a. Beginning Block Number 1. Method = Enter

Variables in the Equation
	B	SE	Wald	df	Sig.	Exp(B)	95.0% CI for Exp(B)
							Lower	Upper
Had hypertension	-1.081	.249	18.852	1	.000	.339	.208	.553

Therefore, the variable hypertension significantly predicted the difference in survival (that is, in the occurrence of a stroke): B(1)=-1.081, p<.001. Therefore, the hazard ratio based on the variable hypertension was: Exp(B)=.339 (95% confidence interval:.208-.553), which means that for a patient without hypertension, the stroke hazard is approximately.339 times of that of a patient with hypertension.

The Cox Proportional Hazards Test

The hazard plot shows the change in cumulative hazard of stroke for groups with and without hypertension with the passage of time. The horizontal axis shows time, whereas the vertical axis shows the cumulative hazard, which equals to the negative log of probability of survival (Forthofer et al., 2007). Clearly, the hazard grows considerably faster in the group with hypertension.

Effects of the Presence of Ties

It is stressed that while calculating the partial likelihood function, the order in which events occur plays an important role, because each time an event takes place, certain expressions describing the states of all the subjects that are at risk at the time of that event are added; therefore, if two events are tied, it is not clear which of the participants should be considered having an event, and which ought to be counted as being at risk (Borucka, 2014, p. 95). This hinders the calculation of the partial likelihood function (Borucka, 2014).

To address the problem of ties, Borucka (2014) offers five different methods, the simplest of which is subtracting a very small random number from the time of tied events (it is stated to be quite effective), two more complicated ones are Efron and Breslow approximations (Efron approximation is claimed to be better), whereas the discrete model and the exact expression are the methods which are argued to result in the best model fit but are cumbersome.

References

Borucka, J. (2014). Methods for handling tied events in the Cox proportional hazard model. Studia Oeconomica Posnaniensia, 2(2), 91-106. Web.

Forthofer, R. N., Lee, E. S., & Hernandez, M. (2007). Biostatistics: A guide to design, analysis, and discovery (2nd ed.). Burlington, MA: Elsevier Academic Press.

Appendix 1

Frequencies table for the variable followed:

Time followed in Month
		Frequency	Percent	Valid Percent	Cumulative Percent
Valid	.82	1	.6	.6	.6
	.90	1	.6	.6	1.1
	1.67	1	.6	.6	1.7
	2.48	1	.6	.6	2.2
	3.43	1	.6	.6	2.8
	3.81	1	.6	.6	3.3
	3.83	1	.6	.6	3.9
	3.85	1	.6	.6	4.4
	4.04	1	.6	.6	5.0
	4.63	1	.6	.6	5.6
	4.78	1	.6	.6	6.1
	4.86	1	.6	.6	6.7
	5.26	1	.6	.6	7.2
	5.51	1	.6	.6	7.8
	5.55	1	.6	.6	8.3
	5.88	1	.6	.6	8.9
	5.99	1	.6	.6	9.4
	6.63	1	.6	.6	10.0
	6.91	1	.6	.6	10.6
	6.93	1	.6	.6	11.1
	6.99	1	.6	.6	11.7
	8.04	1	.6	.6	12.2
	9.36	1	.6	.6	12.8
	9.57	1	.6	.6	13.3
	9.63	1	.6	.6	13.9
	9.82	1	.6	.6	14.4
	9.88	1	.6	.6	15.0
	10.54	1	.6	.6	15.6
	10.64	1	.6	.6	16.1
	11.01	1	.6	.6	16.7
	11.62	1	.6	.6	17.2
	11.71	1	.6	.6	17.8
	11.78	1	.6	.6	18.3
	12.08	1	.6	.6	18.9
	12.14	1	.6	.6	19.4
	12.23	1	.6	.6	20.0
	12.85	1	.6	.6	20.6
	13.67	1	.6	.6	21.1
	13.78	1	.6	.6	21.7
	13.79	1	.6	.6	22.2
	13.80	1	.6	.6	22.8
	14.01	1	.6	.6	23.3
	14.09	1	.6	.6	23.9
	14.15	1	.6	.6	24.4
	15.10	1	.6	.6	25.0
	15.63	1	.6	.6	25.6
	15.91	1	.6	.6	26.1
	16.28	1	.6	.6	26.7
	16.29	1	.6	.6	27.2
	17.17	1	.6	.6	27.8
	17.30	1	.6	.6	28.3
	17.45	1	.6	.6	28.9
	17.52	1	.6	.6	29.4
	17.59	1	.6	.6	30.0
	17.89	1	.6	.6	30.6
	17.95	1	.6	.6	31.1
	17.96	1	.6	.6	31.7
	18.02	1	.6	.6	32.2
	18.31	1	.6	.6	32.8
	18.43	1	.6	.6	33.3
	18.91	1	.6	.6	33.9
	18.93	1	.6	.6	34.4
	19.10	1	.6	.6	35.0
	19.45	1	.6	.6	35.6
19.58	1	.6	.6	36.1
19.68	1	.6	.6	36.7
20.18	1	.6	.6	37.2
20.27	1	.6	.6	37.8
21.10	1	.6	.6	38.3
21.38	1	.6	.6	38.9
21.56	1	.6	.6	39.4
21.61	1	.6	.6	40.0
21.66	1	.6	.6	40.6
21.66	1	.6	.6	41.1
21.71	1	.6	.6	41.7
21.79	1	.6	.6	42.2
22.07	1	.6	.6	42.8
22.08	1	.6	.6	43.3
22.12	1	.6	.6	43.9
22.21	1	.6	.6	44.4
22.38	1	.6	.6	45.0
22.45	1	.6	.6	45.6
22.58	1	.6	.6	46.1
22.99	1	.6	.6	46.7
23.01	1	.6	.6	47.2
23.17	1	.6	.6	47.8
23.27	1	.6	.6	48.3
23.32	1	.6	.6	48.9
23.40	1	.6	.6	49.4
23.57	1	.6	.6	50.0
24.06	1	.6	.6	50.6
24.20	1	.6	.6	51.1
24.33	1	.6	.6	51.7
24.35	1	.6	.6	52.2
24.42	1	.6	.6	52.8
24.90	1	.6	.6	53.3
25.09	1	.6	.6	53.9
25.16	1	.6	.6	54.4
25.22	1	.6	.6	55.0
25.25	1	.6	.6	55.6
25.33	1	.6	.6	56.1
25.76	1	.6	.6	56.7
25.77	1	.6	.6	57.2
25.84	1	.6	.6	57.8
25.87	1	.6	.6	58.3
25.89	1	.6	.6	58.9
25.93	1	.6	.6	59.4
26.02	1	.6	.6	60.0
26.26	1	.6	.6	60.6
26.29	1	.6	.6	61.1
26.45	1	.6	.6	61.7
26.62	1	.6	.6	62.2
26.81	1	.6	.6	62.8
27.09	1	.6	.6	63.3
27.33	1	.6	.6	63.9
27.37	1	.6	.6	64.4
27.71	1	.6	.6	65.0
27.72	1	.6	.6	65.6
27.73	1	.6	.6	66.1
28.15	1	.6	.6	66.7
28.64	1	.6	.6	67.2
28.67	1	.6	.6	67.8
28.86	1	.6	.6	68.3
29.73	1	.6	.6	68.9
29.81	1	.6	.6	69.4
29.98	1	.6	.6	70.0
30.09	1	.6	.6	70.6
30.34	1	.6	.6	71.1
30.73	1	.6	.6	71.7
30.89	1	.6	.6	72.2
31.01	1	.6	.6	72.8
31.12	1	.6	.6	73.3
31.20	1	.6	.6	73.9
31.27	1	.6	.6	74.4
31.28	1	.6	.6	75.0
31.30	1	.6	.6	75.6
31.33	1	.6	.6	76.1
31.37	1	.6	.6	76.7
31.40	1	.6	.6	77.2
31.47	1	.6	.6	77.8
31.55	1	.6	.6	78.3
31.61	1	.6	.6	78.9
31.63	1	.6	.6	79.4
31.63	1	.6	.6	80.0
31.64	1	.6	.6	80.6
31.72	1	.6	.6	81.1
31.91	1	.6	.6	81.7
31.97	1	.6	.6	82.2
31.98	1	.6	.6	82.8
32.02	31	17.2	17.2	100.0
Total	180	100.0	100.0