Survival models with gene groups as covariates

dc.contributor.advisorRahnenführer, Jörg
dc.contributor.authorKammers, Kai
dc.contributor.refereeIckstadt, Katja
dc.date.accepted2012-06-26
dc.date.accessioned2012-06-28T12:18:59Z
dc.date.available2012-06-28T12:18:59Z
dc.date.issued2012-06-28
dc.description.abstractAn important application of high-dimensional gene expression measurements is the risk prediction and the interpretation of the variables in the resulting survival models. A major problem in this context is the typically large number of genes compared to the number of observations (individuals). Feature selection procedures can generate predictive models with high prediction accuracy and at the same time low model complexity. However, interpretability of the resulting models is still limited due to little knowledge on many of the remaining selected genes. Thus, we summarize genes as gene groups defined by the hierarchically structured Gene Ontology (GO) and include these gene groups as covariates in the hazard regression models. Since expression profiles within GO groups are often heterogeneous, we present a new method to obtain subgroups with coherent patterns. We apply preclustering to genes within GO groups according to the correlation of their gene expression measurements. We compare Cox models for modeling disease free survival times of breast cancer patients. Besides classical clinical covariates we consider genes, GO groups and preclustered GO groups as additional genomic covariates. Survival models with preclustered gene groups as covariates have improved prediction accuracy in long term survival compared to models built only with single genes or GO groups. We also provide an analysis of frequently chosen covariates and comparisons to models using only clinical information. The preclustering information enables a more detailed analysis of the biological meaning of covariates selected in the final models. Compared to models built only with single genes there is additional functional information contained in the GO annotation, and compared to models using GO groups as covariates the preclustering yields coherent representative gene expression profiles. For evaluation of fitted survival models, we present prediction error curves revealing that models with preclustered gene groups have improved prediction performance compared to models built with single genes or GO groups.en
dc.identifier.urihttp://hdl.handle.net/2003/29489
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-4815
dc.language.isoende
dc.subject.ddc310
dc.subject.rswkHumangenetik ; Genomik; Brustkrebs; Biostatistikde
dc.titleSurvival models with gene groups as covariatesen
dc.typeTextde
dc.type.publicationtypedoctoralThesisde
dcterms.accessRightsopen access

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Dissertation.pdf
Size:
964.77 KB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.85 KB
Format:
Item-specific license agreed upon to submission
Description: