Introduction to statistical pattern recognition (2nd ed.)
Introduction to statistical pattern recognition (2nd ed.)
Information retrieval: data structures and algorithms
Information retrieval: data structures and algorithms
Information storage and retrieval
Information storage and retrieval
Self-organizing maps
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Neural Networks: A Comprehensive Foundation
Neural Networks: A Comprehensive Foundation
Modern Information Retrieval
Text Retrieval Using Self-Organized Document Maps
Neural Processing Letters
Order statistics learning vector quantizer
IEEE Transactions on Image Processing
Self organization of a massive document collection
IEEE Transactions on Neural Networks
Adaptive topological tree structure for document organisation and visualisation
Neural Networks - 2004 Special issue: New developments in self-organizing systems
Document classification system based on HMM word map
CSTST '08 Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology
A granular computing framework for self-organizing maps
Neurocomputing
Expert Systems with Applications: An International Journal
Multilayer SOM with tree-structured data for efficient document retrieval and plagiarism detection
IEEE Transactions on Neural Networks
A coarse-to-fine framework to efficiently thwart plagiarism
Pattern Recognition
Hi-index | 0.00 |
The self-organizing map algorithm has been used successfully in document organization. We now propose using the same algorithm for document retrieval. Moreover, we test the performance of the self-organizing map by replacing the linear Least Mean Squares adaptation rule with the marginal median. We present two implementations of the latter variant of the self-organizing map by either quantizing the real valued feature vectors to integer valued ones or not. Experiments performed using both implementations demonstrate a superior performance against the self-organizing map based method in terms of the number of training iterations needed so that the mean square error (i.e. the average distortion) drops to the e-1 = 36.788% of its initial value. Furthermore, the performance of a document organization and retrieval system employing the self-organizing map architecture and its variant is assessed using the average recall-precision curves evaluated on two corpora; the first comprises of manually selected web pages over the Internet having touristic content and the second one is the Reuters- 21578, Distribution 1.0.