The nature of statistical learning theory
The nature of statistical learning theory
Making large-scale support vector machine learning practical
Advances in kernel methods
Hierarchical multi-label prediction of gene function
Bioinformatics
A new method to measure the semantic similarity of GO terms
Bioinformatics
Bioinformatics
Prediction of Trans-regulators of Recombination Hotspots in Mouse Genome
BIBM '11 Proceedings of the 2011 IEEE International Conference on Bioinformatics and Biomedicine
Hi-index | 0.00 |
The regulatory mechanism of meiotic recombination hotspots is a fundamental problem in biology, with broad impacts on areas ranging from disease study to evolution. Recently, many genomic and epigenomic features have been associated with recombination hotspots, but none of them can explain hotspots consistently. It is highly desirable to integrate the different features into a predictive model, and study the relation of the features with hotspots and themselves with a systems approach. Moreover, due to rapid and dynamic evolution of recombination hotspots, regulatory mechanisms of hotspots that are evolutionarily conserved among species remain unclear. We propose a machine learning approach that encode genomic and epigenomic features into a support vector machine (SVM). Trained on known hotspots and coldspots in human and mouse genomes, the model is able to predict hotspots based on the features with good performance in both species. Moreover, the model reports a ranking of feature importance, uncovering the interactions of the features with hotspots and themselves. Applying the method to large-scale data, we identified evolutionarily conserved patterns of trans-regulators and feature importance between human and mouse hotspots. This is the first attempt to build a predictive model to identify evolutionarily conserved mechanisms for recombination hotspots by integrating both genomic and epigenomic features.