C4.5: programs for machine learning
C4.5: programs for machine learning
Self-organizing maps
Machine Learning
Hi-index | 0.00 |
In this paper, we present a system for marking up text documents into XML on a Self-Organising Map (SOM). The system organises pre-tagged XML documents on the Self-Organising Map such that the documents similar in content are placed closer to each other. Then, by employing the inductive learning algorithm C5.0, the system learns markup rules from the nearest SOM neighbours of a new unmarked document. Experiments with the system on a number of document corpora demonstrate that our approach is promising.