visualising semantic spaces and author co-citation networks in digital libraries
Information Processing and Management: an International Journal - Special issue on progress toward digital libraries
Mining frequent patterns without candidate generation
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Characterizing and Mining the Citation Graph of the Computer Science Literature
Knowledge and Information Systems
Applying passage in Web text mining
International Journal of Intelligent Systems - Intelligent Technologies
Quality Evaluation for Document Relation Discovery Using Citation Information
IEICE - Transactions on Information and Systems
Hi-index | 0.00 |
Word-based relations among technical documents are immensely useful information but often hidden in a large amount of scientific publications. This work presents a method to apply latent semantic indexing in frequent itemset mining to discover potential relations among scientific publications. In this work, two weighting schemes, tf and tfidf are investigated with the exploitation of latent semantic indexing. The proposed method is evaluated using a set of technical documents in a publication database by comparing the extracted document relations with their references (citations). To this end, the paper uses order accumulative citation matrices to evaluate the validity (quality) of discovered patterns. The results also show that the proposed method successfully discovers a set of document relations, comparing to the original method that uses no latent semantic indexing.