Modern Information Retrieval
Robust automated topic identification
Robust automated topic identification
Knowledge-based automatic topic identification
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
RCV1: A New Benchmark Collection for Text Categorization Research
The Journal of Machine Learning Research
Discovering missing links in Wikipedia
Proceedings of the 3rd international workshop on Link discovery
Proceedings of the 15th international conference on World Wide Web
The problem of ontology alignment on the web: a first report
WAC '06 Proceedings of the 2nd International Workshop on Web as Corpus
NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Automatic assignment of wikipedia encyclopedic entries to wordnet synsets
AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
Measuring article quality in wikipedia: models and evaluation
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
On ranking controversies in wikipedia: models and evaluation
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Proceedings of the 17th international conference on World Wide Web
Overview of INEX 2007 Link the Wiki Track
Focused Access to XML Documents
What's in Wikipedia?: mapping topics and conflict using socially annotated category structure
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Understanding user's query intent with wikipedia
Proceedings of the 18th international conference on World wide web
Web Search Clustering and Labeling with Hidden Topics
ACM Transactions on Asian Language Information Processing (TALIP)
Enhancing cluster labeling using wikipedia
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
International Journal of Human-Computer Studies
Document word clouds: visualising web documents as tag clouds to aid users in relevance decisions
ECDL'09 Proceedings of the 13th European conference on Research and advanced technology for digital libraries
Using Wikipedia categories for compact representations of chemical documents
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Query classification using Wikipedia
International Journal of Intelligent Information and Database Systems
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Harvesting Wikipedia Knowledge to Identify Topics in Ongoing Natural Language Dialogs
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Advertising Keywords Recommendation for Short-Text Web Pages Using Wikipedia
ACM Transactions on Intelligent Systems and Technology (TIST)
TODWEB: training-less ontology based deep web source classification
Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Learning local content shift detectors from document-level information
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Topic mining based on graph local clustering
MICAI'11 Proceedings of the 10th international conference on Artificial Intelligence: advances in Soft Computing - Volume Part II
Mining interests for user profiling in electronic conversations
Expert Systems with Applications: An International Journal
Making your interests follow you on twitter
Proceedings of the 21st ACM international conference on Information and knowledge management
Learning multilingual named entity recognition from Wikipedia
Artificial Intelligence
Building Multi-Modal Relational Graphs for Multimedia Retrieval
International Journal of Multimedia Data Engineering & Management
Understanding the top grass roots in sina-weibo
IScIDE'12 Proceedings of the third Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering
Recognition of word collocation habits using frequency rank ratio and inter-term intimacy
Expert Systems with Applications: An International Journal
Extracting semantic knowledge from Wikipedia category names
Proceedings of the 2013 workshop on Automated knowledge base construction
Exploiting topic tracking in real-time tweet streams
Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing
Relational term-suggestion graphs incorporating multipartite concept and expertise networks
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Section on Intelligent Mobile Knowledge Discovery and Management Systems and Special Issue on Social Web Mining
Hi-index | 0.00 |
In the last few years the size and coverage of Wikipe- dia, a freely available on-line encyclopedia has reached the point where it can be utilized similar to an ontology or tax- onomy to identify the topics discussed in a document. In this paper we will show that even a simple algorithm that exploits only the titles and categories of Wikipedia articles can characterize documents by Wikipedia categories sur- prisingly well. We test the reliability of our method by pre- dicting categories ofWikipedia articles themselves based on their bodies, and by performing classification and cluster- ing on 20 Newsgroups and RCV1, representing documents by their Wikipedia categories instead of their texts.