Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Similarity Search in High Dimensions via Hashing
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
An algorithm for unsupervised topic discovery from broadcast news stories
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Building a hierarchy of events and topics for newspaper digital libraries
ECIR'03 Proceedings of the 25th European conference on IR research
Hi-index | 0.00 |
We propose an unsupervised method for propagating automatically extracted fine-grained topic labels among news items to improve their topic description for subsequent text classification procedure. This method compares vector representations of news items and assigns to each news item the label of its closest neighbour with a different topic label. Results obtained show that high precision can be achieved in propagating the top ranked topic label, and that 2-gram and 3-gram feature representations optimize the precision.