The Journal of Machine Learning Research
Suggesting novel but related topics: towards context-based support for knowledge model extension
Proceedings of the 10th international conference on Intelligent user interfaces
Discovering missing links in Wikipedia
Proceedings of the 3rd international workshop on Link discovery
Wikify!: linking documents to encyclopedic knowledge
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Introduction to Information Retrieval
Introduction to Information Retrieval
Learning to link with wikipedia
Proceedings of the 17th ACM conference on Information and knowledge management
Get out the vote: determining support or opposition from congressional floor-debate transcripts
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Coherent keyphrase extraction via web mining
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Completing wikipedia's hyperlink structure through dimensionality reduction
Proceedings of the 18th ACM conference on Information and knowledge management
Proceedings of the 2010 ACM conference on Computer supported cooperative work
Semi-automatic construction of topic ontologies
EWMF'05/KDO'05 Proceedings of the 2005 joint international conference on Semantics, Web and Mining
Supporting software language engineering by automated domain knowledge acquisition
MODELS'11 Proceedings of the 2011th international conference on Models in Software Engineering
Exploiting potential citation papers in scholarly paper recommendation
Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Can back-of-the-book indexes be automatically created?
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Automated construction of a large semantic network of related terms for domain-specific modeling
CAiSE'13 Proceedings of the 25th international conference on Advanced Information Systems Engineering
Hi-index | 0.02 |
We present a method for automated topic suggestion. Given a plain-text input document, our algorithm produces a ranking of novel topics that could enrich the input document in a meaningful way. It can thus be used to assist human authors, who often fail to identify important topics relevant to the context of the documents they are writing. Our approach marries two algorithms originally designed for linking documents to Wikipedia articles, proposed by Milne and Witten [15] and West et al. [22]. While neither of them can suggest novel topics by itself, their combination does have this capability. The key step towards finding missing topics consists in generalizing from a large background corpus using principal component analysis. In a quantitative evaluation we conclude that our method achieves the precision of human editors when input documents are Wikipedia articles, and we complement this result with a qualitative analysis showing that the approach also works well on other types of input documents.