A common theory of information fusion from multiple text sources step one: cross-document structure
SIGDIAL '00 Proceedings of the 1st SIGdial workshop on Discourse and dialogue - Volume 10
A diachronic analysis of gender-related web communities using a HITS-Based mining tool
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Hi-index | 0.00 |
This paper addresses a retrieval method for BBS(Bulletin Board System) articles with relevance index between the retrieval query and an article. Simply using the keyword-based retrieval has limitation on narrowing the articles, because most BBS articles include various keywords and such combination of some unrelated keywords to the retrieval query causes unexpected results. On the other hand, most BBSs have a characteristic structure, so-called "thread", which consists of one question article and a set of answer articles. Based on this structure, our method calculates the relevance index of each part of an article with association index among words derived from the Internet search engine results. We applied it to a practical word-of-mouth BBS and compared with the retrieval method of cosine similarity index in the word-vector space. The results show that our method had 30% better retrieval accuracy.