Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?
222
Zitationen
4
Autoren
2008
Jahr
Abstract
MOTIVATION: Survival prediction of breast cancer (BC) patients independently of treatment, also known as prognostication, is a complex task since clinically similar breast tumors, in addition to be molecularly heterogeneous, may exhibit different clinical outcomes. In recent years, the analysis of gene expression profiles by means of sophisticated data mining tools emerged as a promising technology to bring additional insights into BC biology and to improve the quality of prognostication. The aim of this work is to assess quantitatively the accuracy of prediction obtained with state-of-the-art data analysis techniques for BC microarray data through an independent and thorough framework. RESULTS: Due to the large number of variables, the reduced amount of samples and the high degree of noise, complex prediction methods are highly exposed to performance degradation despite the use of cross-validation techniques. Our analysis shows that the most complex methods are not significantly better than the simplest one, a univariate model relying on a single proliferation gene. This result suggests that proliferation might be the most relevant biological process for BC prognostication and that the loss of interpretability deriving from the use of overcomplex methods may be not sufficiently counterbalanced by an improvement of the quality of prediction. AVAILABILITY: The comparison study is implemented in an R package called survcomp and is available from http://www.ulb.ac.be/di/map/bhaibeka/software/survcomp/.
Ähnliche Arbeiten
Analysis of Relative Gene Expression Data Using Real-Time Quantitative PCR and the 2−ΔΔCT Method
2001 · 179.675 Zit.
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
2005 · 55.920 Zit.
<tt>edgeR</tt> : a Bioconductor package for differential expression analysis of digital gene expression data
2009 · 44.036 Zit.
limma powers differential expression analyses for RNA-sequencing and microarray studies
2015 · 42.277 Zit.
clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters
2012 · 37.440 Zit.