The impact of corpus size on question answering performance
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Spatial information retrieval and geographical ontologies an overview of the SPIRIT project
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Replicating Web Structure in Small-Scale Test Collections
Information Retrieval
Extracting metadata for spatially-aware information retrieval on the internet
Proceedings of the 2005 workshop on Geographic information retrieval
Combining fields for query expansion and adaptive query expansion
Information Processing and Management: an International Journal
Implementing a characterization of genre for automatic genre identification of web pages
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Pruning SIFT for scalable near-duplicate image matching
ADC '07 Proceedings of the eighteenth conference on Australasian database - Volume 63
Detection of near-duplicate images for web search
Proceedings of the 6th ACM international conference on Image and video retrieval
Clustering near-duplicate images in large collections
Proceedings of the international workshop on Workshop on multimedia information retrieval
Zero, single, or multi? Genre of web pages through the users' perspective
Information Processing and Management: an International Journal
International Journal of Geographical Information Science
Discovery of image versions in large collections
MMM'07 Proceedings of the 13th International conference on Multimedia Modeling - Volume Part II
Automatic genre identification: towards a flexible classification scheme
FDIA'07 Proceedings of the 1st BCS IRSG conference on Future Directions in Information Access
Hi-index | 0.00 |
A large scale collection of web pages has been essential for research in information retrieval and related areas. This paper provides an overview of a large web collection used in the SPIRIT project for the design and testing of spatially-aware retrieval systems. Several statistics are derived and presented to show the characteristics of the collection.