Multi-resolution Boosting for Classification and Regression Problems
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Nucleosome occupancy information improves de novo motif discovery
RECOMB'07 Proceedings of the 11th annual international conference on Research in computational molecular biology
Computational Statistics & Data Analysis
Computational molecular biology of genome expression and regulation
PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
Functional characterization of drug-protein interactions network
Intelligent Data Analysis
Hi-index | 3.84 |
Motivation: Building an accurate binding model for a transcription factor (TF) is essential to differentiate its true binding targets from those spurious ones. This is an important step toward understanding gene regulation. Results: This paper describes a boosting approach to modeling TF--DNA binding. Different from the widely used weight matrix model, which predicts TF--DNA binding based on a linear combination of position-specific contributions, our approach builds a TF binding classifier by combining a set of weight matrix based classifiers, thus yielding a non-linear binding decision rule. The proposed approach was applied to the ChIP-chip data of Saccharomyces cerevisiae. When compared with the weight matrix method, our new approach showed significant improvements on the specificity in a majority of cases. Contact: wwong@hsph.harvard.edu Supplementary information: The software and the Supplementary data are available at http://biogibbs.stanford.edu/~hong2004/MotifBooster/.