week4_4choosing_predictors

methods of predictor selection

original model
- blood pressure (HR=1.30, p=0.0002)
- cholesterol (HR=1.05, p=0.155)
cholesterol removed
- blood pressure (HR=1.50)
- you judge that HR 1.3 to 1.5 is big enough change (more below)
add cholesterol back
- blood pressure HR back to 1.30
conclude there’s correlation between blood pressure and cholesterol, need to keep both in model
(reason why stepwise procedure is so unreliable, {will miss these correlations})
how to judge if a variable’s HR change was big enough to warrant adding predictors back?
- arbitrary
- usually HR change > 0.05 is viewed as big enough
- e.g. HR 1.30 -> 1.34 not enough
- depends on how results is going to be used:
  - people invited into a national screening programme, based on their estimated risk of developing disease,
    - using coefficient of 1.30 instead of 1.50 greatly affects number of people invited
  - epidemiological study of risk factors
    - 1.30 and 1.50 not that different,
    - all we do in finding risk factors is to say “these ARE significant risk factors, these are not”, HR is secondary importance {p-value more important}