Refined approximations to permutation tests for multivariate inference
Computational Statistics & Data Analysis
A Hidden Markov Model for Predicting Transmembrane Helices in Protein Sequences
ISMB '98 Proceedings of the 6th International Conference on Intelligent Systems for Molecular Biology
F-score with Pareto Front Analysis for Multiclass Gene Selection
EvoBIO '09 Proceedings of the 7th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics
ICIC'06 Proceedings of the 2006 international conference on Computational Intelligence and Bioinformatics - Volume Part III
Multiclass Gene Selection Using Pareto-Fronts
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Pacc - a discriminative and accuracy correlated measure for assessment of classification results
MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Hi-index | 0.00 |
Predicted assignments of biological sequences are often evaluated by Matthews correlation coefficient. However, Matthews correlation coefficient applies only to cases where the assignments belong to two categories, and cases with more than two categories are often artificially forced into two categories by considering what belongs and what does not belong to one of the categories, leading to the loss of information. Here, an extended correlation coefficient that applies to K-categories is proposed, and this measure is shown to be highly applicable for evaluating prediction of RNA secondary structure in cases where some predicted pairs go into the category ''unknown'' due to lack of reliability in predicted pairs or unpaired residues. Hence, predicting base pairs of RNA secondary structure can be a three-category problem. The measure is further shown to be well in agreement with existing performance measures used for ranking protein secondary structure predictions. Server and software is available at http://rk.kvl.dk/