Using the structure of documents to improve the discovery of unexpected information
Proceedings of the 2006 ACM symposium on Applied computing
Extracting Advantage Phrases That Hint at a New Technology's Potentials
PAKM '08 Proceedings of the 7th International Conference on Practical Aspects of Knowledge Management
Discovering unexpected documents in corpora
Knowledge-Based Systems
Hi-index | 0.00 |
The purpose of technology watch is to gather, process and integrate the scientific and technical information that is useful to economic players. In this article, we propose to use text mining techniques to automate processing of data found in scientific text databases. The watch activity introduces an unusual difficulty compared with conventional areas of application for text mining techniques since, instead of searching for frequent knowledge hidden in the texts, the target is unexpected knowledge. As a result, the usual measures used for knowledge discovery have to be revised. For that purpose, we have developed the UnexpectedMiner system using new measures for to estimate the unexpectedness of a document. Our system is evaluated using a base that contains articles relating to the field of machine learning.