Feature-rich part-of-speech tagging with a cyclic dependency network
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
SenseRelate::TargetWord: a generalized framework for word sense disambiguation
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
An Algorithm for Classifying Articles and Patent Documents Using Link Structure
WAIM '08 Proceedings of the 2008 The Ninth International Conference on Web-Age Information Management
WordNet: similarity - measuring the relatedness of concepts
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Hi-index | 0.00 |
We present a generic approach for semantic based classification of text documents to pre-defined categories. The proposed technique is applied to the domain of patent analytics for the purpose of classifying a collection of patent documents to one or many nodes in a user-defined taxonomy. The proposed approach is a multi-step process consisting of noun extraction, word sense disambiguation, semantic relatedness computation between pair of words using WordNet and confidence score computation. The proposed algorithm resulted in good accuracy on experimental dataset and can be easily adapted and customized to other domains other the patent landscape analysis domain discussed in this paper.