Sibling page search by page examples

Authors:
Hiroaki Ohshima;Satoshi Oyama;Katsumi Tanaka
Affiliations:
Department of Social Informatics, Graduate School of Informatics, Kyoto University, Kyoto, Japan;Department of Social Informatics, Graduate School of Informatics, Kyoto University, Kyoto, Japan;Department of Social Informatics, Graduate School of Informatics, Kyoto University, Kyoto, Japan
Venue:
ICADL'06 Proceedings of the 9th international conference on Asian Digital Libraries: achievements, Challenges and Opportunities
Year:
2006

Citing 4
Cited 3

Text Information Retrieval Systems

Text Information Retrieval Systems
Novelty and redundancy detection in adaptive filtering

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
MindReader: Querying Databases Through Multiple Examples

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases

A query-free retrieving method based on content elements' order for multimedia news archives

ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
Searching the web for alternative answers to questions on WebQA sites

WAIM'10 Proceedings of the 11th international conference on Web-age information management
Search for minority information from wikipedia based on similarity of majority information

APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose methods of searching Web pages that are “semantically” regarded as “siblings” with respect to given page examples. That is, our approach aims to find pages that are similar in theme but have different content from the given sample pages. We called this “sibling page search”. The proposed search methods are different from conventional content-based similarity search for Web pages. Our approach recommends Web pages whose “conceptual” classification category is the same as that of the given sample pages, but whose content is different from the sample pages. In this sense, our approach will be useful for supporting a user's opportunistic search, meaning a search in which the user's interest and intention are not fixed. The proposed methods were implemented by computing the “common” and “unique” feature vectors of the given sample pages, and by comparing those feature vectors with each retrieved page. We evaluated our method for sibling page search, in which our method was applied to test sets consisting of page collections from the Open Directory Project (ODP).