Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Projections for efficient document clustering
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic Visual Learning for Object Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Text Classification from Labeled and Unlabeled Documents using EM
Machine Learning - Special issue on information retrieval
Information Retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Document Ranking and the Vector-Space Model
IEEE Software
Learning a Language-Independent Representation for Terms from a Partially Aligned Corpus
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Beyond Eigenfaces: Probabilistic Matching for Face Recognition
FG '98 Proceedings of the 3rd. International Conference on Face & Gesture Recognition
Hierarchical Bayesian clustering for automatic text classification
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Athena: text mining based discovery of scientific workflows in disperse repositories
RED'10 Proceedings of the Third international conference on Resource Discovery
Hi-index | 0.00 |
We have developed an effective probabilistic classifier for document classification by introducing the concept of the differential document vectors and DLSI (differential latent semantics index) spaces. A simple posteriori calculation using the intra- and extra-document statistics demonstrates the advantage of the DLSI space-based probabilistic classifier over the popularly used LSI space-based classifier in classification performance.