C4.5: programs for machine learning
C4.5: programs for machine learning
CYC: a large-scale investment in knowledge infrastructure
Communications of the ACM
Data mining: practical machine learning tools and techniques with Java implementations
Data mining: practical machine learning tools and techniques with Java implementations
Aggressive morphology for robust lexical coverage
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Hi-index | 0.00 |
We present an approach for learning part-of-speech distinctions by induction over the lexicon of the Cyc knowledge base. This produces good results (74.6%) using a decision tree that incorporates both semantic features and syntactic features. Accurate results (90.5%) are achieved for the special case of deciding whether lexical mappings should use count noun or mass noun headwords. Comparable results are also obtained using OpenCyc, the publicly available version of Cyc.