TY - GEN
T1 - Classification of high-resolution NMR spectra based on complex wavelet domain feature selection and kernel-induced random forest
AU - Fan, Guangzhe
AU - Wang, Zhou
AU - Kim, Seoung Bum
AU - Temiyasathit, Chivalai
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2010
Y1 - 2010
N2 - High-resolution nuclear magnetic resonance (NMR) spectra contain important biomarkers that have potentials for early diagnosis of disease and subsequent monitoring of its progression. Traditional features extraction and analysis methods have been carried out in the original frequency spectrum domain. In this study, we conduct feature selection based on a complex wavelet transform by making use of its energy shift-insensitive property in a multi-resolution signal decomposition. A false discovery rate based multiple testing procedure is employed to identify important metabolite features. Furthermore, a novel kernel-induced random forest algorithm is used for the classification of NMR spectra based on the selected features. Our experiments with real NMR spectra showed that the proposed method leads to significant reduction in misclassification rate.
AB - High-resolution nuclear magnetic resonance (NMR) spectra contain important biomarkers that have potentials for early diagnosis of disease and subsequent monitoring of its progression. Traditional features extraction and analysis methods have been carried out in the original frequency spectrum domain. In this study, we conduct feature selection based on a complex wavelet transform by making use of its energy shift-insensitive property in a multi-resolution signal decomposition. A false discovery rate based multiple testing procedure is employed to identify important metabolite features. Furthermore, a novel kernel-induced random forest algorithm is used for the classification of NMR spectra based on the selected features. Our experiments with real NMR spectra showed that the proposed method leads to significant reduction in misclassification rate.
KW - Classification tree
KW - Complex wavelet transforms
KW - False discovery rate
KW - High-resolution NMR spectrum
KW - Kernel
KW - Metabolomics
KW - Random forest
UR - http://www.scopus.com/inward/record.url?scp=79956288830&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79956288830&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-13681-8_69
DO - 10.1007/978-3-642-13681-8_69
M3 - Conference contribution
AN - SCOPUS:79956288830
SN - 364213680X
SN - 9783642136801
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 593
EP - 600
BT - Image and Signal Processing - 4th International Conference, ICISP 2010, Proceedings
T2 - 4th International Conference on Image and Signal Processing, ICISP 2010
Y2 - 30 June 2010 through 2 July 2010
ER -