figshare
Browse
1/1
2 files

On Estimation of the Hazard Function From Population-Based Case–Control Studies

Version 2 2018-06-12, 17:49
Version 1 2017-08-07, 19:27
dataset
posted on 2018-06-12, 17:49 authored by Li Hsu, Malka Gorfine, David Zucker

The population-based case–control study design has been widely used for studying the etiology of chronic diseases. It is well established that the Cox proportional hazards model can be adapted to the case–control study and hazard ratios can be estimated by (conditional) logistic regression model with time as either a matched set or a covariate. However, the baseline hazard function, a critical component in absolute risk assessment, is unidentifiable, because the ratio of cases and controls is controlled by the investigators and does not reflect the true disease incidence rate in the population. In this article, we propose a simple and innovative approach, which makes use of routinely collected family history information, to estimate the baseline hazard function for any logistic regression model that is fit to the risk factor data collected on cases and controls. We establish that the proposed baseline hazard function estimator is consistent and asymptotically normal and show via simulation that it performs well in finite samples. We illustrate the proposed method by a population-based case–control study of prostate cancer where the association of various risk factors is assessed and the family history information is used to estimate the baseline hazard function. Supplementary materials for this article are available online.

Funding

The authors gratefully acknowledge National Institutes of Health grants P01CA53996, R01CA189532, and R01CA195789.

History

Usage metrics

    Journal of the American Statistical Association

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC