Bioinformatics
Normalized mutual information feature selection
IEEE Transactions on Neural Networks
Bioinformatics
Computational Biology and Chemistry
Hi-index | 0.00 |
Multiple sequence alignment (MSA) is an essential approach to apply in other outstanding bioinformatics tasks such as structural predictions, biological function analyses or phylogenetic modeling. However, current MSA methodologies do not reach a consensus about how sequences must be accurately aligned. Moreover, these tools usually provide partially optimal alignments, as each one is focused on specific features. Thus, the same set of sequences can provide quite different alignments, overall when sequences are less related. Consequently, researchers and biologists do not agree on how the quality of MSAs should be evaluated in order to decide the most adequate methodology. Therefore, recent evaluations tend to use more complex scores including supplementary biological features. In this work, we address the evaluation of MSAs by using a novel supervised learning approach based on Least Square Support Vector Machine (LS-SVM). This algorithm will include a set of heterogeneous features and scores in order to determine the alignment accuracies. It is assessed by means of the benchmark BAliBASE.