Robust prostate cancer marker genes emerge from direct integration of inter-study microarray data

Lei Xu, Aik Choon Tan, Daniel Q. Naiman, Donald Geman, Raimond L. Winslow

Research output: Contribution to journalArticlepeer-review

113 Citations (Scopus)


Motivation: DNA microarray data analysis has been used previously to identify marker genes which discriminate cancer from normal samples. However, due to the limited sample size of each study, there are few common markers among different studies of the same cancer. With the rapid accumulation of microarray data, it is of great interest to integrate inter-study microarray data to increase sample size, which could lead to the discovery of more reliable markers. Results: We present a novel, simple method of integrating different microarray datasets to identify marker genes and apply the method to prostate cancer datasets. In this study, by applying a new statistical method, referred to as the top-scoring pair (TSP) classifier, we have identified a pair of robust marker genes (HPN and STAT6) by integrating microarray datasets from three different prostate cancer studies. Cross-platform validation shows that the TSP classifier built from the marker gene pair, which simply compares relative expression values, achieves high accuracy, sensitivity and specificity on independent datasets generated using various array platforms. Our findings suggest a new model for the discovery of marker genes from accumulated microarray data and demonstrate how the great wealth of microarray data can be exploited to increase the power of statistical analysis.

Original languageEnglish
Pages (from-to)3905-3911
Number of pages7
Issue number20
Publication statusPublished - 2005 Oct

Bibliographical note

Funding Information:
This work was supported by NHLBI RO1-HL72488, The Falk Medical Trust and The Whitaker Foundation.

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics


Dive into the research topics of 'Robust prostate cancer marker genes emerge from direct integration of inter-study microarray data'. Together they form a unique fingerprint.

Cite this