Abstract
The genetic regulatory mechanism heavily influences a substantial portion of biological functions and processes needed to sustain life. For a comprehensive mechanistic understanding of biological processes, it is important to identify the common transcription factor (TF) binding sites (TFBSs) from a set of promoter sequences of co-regulated genes and classify genes that are co-regulated by certain TFs, therefore to provide an insight into the mechanism that underlies the interaction among the co-regulated genes and complicate genetic regulation. We propose a new supervised-weighted discrete kernel clustering (SWDKC) classification method for the identification of TFBS and the classification of gene. Our SWDKC method gave smaller misclassification error rate than the other methods on both the simulated data and the real NF-κB data. We verify that the selected over-represented TFBSs serve informative TFBSs from a biological point of view.
| Original language | English |
|---|---|
| Pages (from-to) | 573-581 |
| Number of pages | 9 |
| Journal | Journal of Applied Statistics |
| Volume | 41 |
| Issue number | 3 |
| DOIs | |
| Publication status | Published - 2014 Mar |
Keywords
- gene classification
- supervised clustering
- support vector machines
- transcription factor
- transcription factor-binding site
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty
Fingerprint
Dive into the research topics of 'Transcription factor-binding site identification and gene classification via fusion of the supervised-weighted discrete kernel clustering and support vector machine'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS