Abstract
Large-scale transcriptome profiling in clinical studies often involves assaying multiple samples of a patient to monitor disease progression, treatment effect, and host response in multiple tissues. Such profiling is prone to human error, which often results in mislabeled samples. Here, we present a method to detect mislabeled sample outliers using coding single nucleotide polymorphisms (cSNPs) specifically designed on the microarray and demonstrate that the mislabeled samples can be efficiently identified by either simple clustering of allele-specific expression scores or Mahalanobis distance-based outlier detection method. Based on our results, we recommend the incorporation of cSNPs into future transcriptome array designs as intrinsic markers for sample tracking.
Original language | English |
---|---|
Pages (from-to) | 386-387 |
Number of pages | 2 |
Journal | BioTechniques |
Volume | 52 |
Issue number | 6 |
DOIs | |
Publication status | Published - 2012 Jun |
Externally published | Yes |
Keywords
- Coding SNP
- Microarray
- Outlier detection
- Sample tracking
- Transcriptome profiling
ASJC Scopus subject areas
- Biotechnology
- General Biochemistry,Genetics and Molecular Biology