TCS: a shell for content-based text categorization
Proceedings of the sixth conference on Artificial intelligence applications
Reexamining the cluster hypothesis: scatter/gather on retrieval results
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Enhanced hypertext categorization using hyperlinks
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
MAPA: a system for inducing and visualizing hierarchy in Websites
Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems
Web document clustering: a feasibility demonstration
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A knowledge-based approach to organizing retrieved documents
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Task-oriented world wide web retrieval by document type classification
Proceedings of the eighth international conference on Information and knowledge management
The rise of ontologies or the reinvention of classification
Journal of the American Society for Information Science - Special issue on the 50th anniversary of the Journal of The American Society for Information Science: part 2: paradigms, models and methods of information science
Hierarchical classification of Web content
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Judgement of information quality and cognitive authority in the Web
Journal of the American Society for Information Science and Technology
Using web structure for classifying and describing web pages
Proceedings of the 11th international conference on World Wide Web
Web classification using support vector machine
Proceedings of the 4th international workshop on Web information and data management
QProber: A system for automatic classification of hidden-Web databases
ACM Transactions on Information Systems (TOIS)
WWW '03 Proceedings of the 12th international conference on World Wide Web
ACM SIGIR Forum
Characteristics of WWW Client-based Traces
Characteristics of WWW Client-based Traces
Concept Hierarchy Based Text Database Categorization in a Metasearch Engine Environment
WISE '00 Proceedings of the First International Conference on Web Information Systems Engineering (WISE'00)-Volume 1 - Volume 1
Using urls and table layout for web classification tasks
Proceedings of the 13th international conference on World Wide Web
Machine learning for information architecture in a large governmental website
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Web page classification without the web page
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Journal of the American Society for Information Science and Technology
Information search and re-access strategies of experienced web users
WWW '05 Proceedings of the 14th international conference on World Wide Web
Using ODP metadata to personalize search
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Communications of the ACM - Supporting exploratory search
Continuum: designing timelines for hierarchies, relationships and scale
Proceedings of the 20th annual ACM symposium on User interface software and technology
Users can change their web search tactics: Design guidelines for categorized overviews
Information Processing and Management: an International Journal
A study about browsers in the Web and the Desktop
EATIS '07 Proceedings of the 2007 Euro American conference on Telematics and information systems
Toward automatic facet analysis and need negotiation: Lessons from mediated search
ACM Transactions on Information Systems (TOIS)
Web Information Retrieval Support Systems: The Future of Web Search
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
An evaluation framework of user interaction with metadata surrogates
Journal of Information Science
International Journal of Advanced Intelligence Paradigms
Interface design and evaluation of a personal information space for mobile learners
International Journal of Mobile Learning and Organisation
Intent-Based Categorization of Search Results Using Questions from Web Q&A Corpus
WISE '09 Proceedings of the 10th International Conference on Web Information Systems Engineering
From Keyword Search to Exploration: Designing Future Search Interfaces for the Web
Foundations and Trends in Web Science
Exploratory web searching with dynamic taxonomies and results clustering
ECDL'09 Proceedings of the 13th European conference on Research and advanced technology for digital libraries
Information retrieval in structured domains
ADC '09 Proceedings of the Twentieth Australasian Conference on Australasian Database - Volume 92
Design factors affecting relevance judgment behaviour in the context of metadata surrogates
Journal of Information Science
A web 2.0 approach for organizing search results using wikipedia
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Information vs interaction: examining different interaction models over consistent metadata
Proceedings of the 4th Information Interaction in Context Symposium
Hi-index | 0.00 |
When search results against digital libraries and web resources have limited metadata, augmenting them with meaningful and stable category information can enable better overviews and support user exploration. This paper proposes six fast-feature techniques that use only features available in the search result list, such as title, snippet, and URL, to categorize results into meaningful categories. They use credible knowledge resources, including a US government organizational hierarchy, a thematic hierarchy from the Open Directory Project (ODP) web directory, and personal browse histories, to add valuable metadata to search results. In three tests the percent of results categorized for five representative queries was high enough to suggest practical benefits: general web search (76-90%), government web search (39-100%), and the Bureau of Labor Statistics website (48-94%). An additional test submitted 250 TREC queries to a search engine and successfully categorized 66% of the top 100 using the ODP and 61% of the top 350. Fast-feature techniques have been implemented in a prototype search engine. We propose research directions to improve categorization rates and make suggestions about how web site designers could re-organize their sites to support fast categorization of search results.