The Journal of Machine Learning Research
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Hi-index | 0.00 |
The type III secretion system (T3SS) is a specialized protein delivery system that plays a key role in pathogenic bacteria. Until now, the secretion mechanism has not been fully understood yet. Recently, a lot of emphasis has been put on identifying type III secreted effectors (T3SE) in order to uncover the signal and principle that guide the secretion process. However, the amino acid sequences of T3SEs have great sequence diversity through fast evolution and many T3SEs have no homolog in the public databases at all. Therefore, it is notoriously challenging to recognize T3SEs. In this paper, we use amino acid sequence features to predict T3SEs, and conduct feature reduction using a topic model. The experimental results on Pseudomonas syringae data set demonstrate that the proposed method can effectively reduce the features and improve the prediction accuracy at the same time.