Automatically maintaining wrappers for semi-structured web sources
Data & Knowledge Engineering
ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Method combination for information extraction
Proceedings of the 11th International Conference on Computer Systems and Technologies and Workshop for PhD Students in Computing on International Conference on Computer Systems and Technologies
Hi-index | 0.00 |
This paper proposes an ontology-driven self-adapting approach insemi-structure web information extracting field, where ontologyprovides semantic support and plays a central role during theextraction process. It excels traditional wrapper systems atadaptiveness and maintenance. Firstly, we build a domain-dependantontology. Then we design three templates generating algorithms,which have self-adaptiveness and self-maintenance based on theontology, to perform the web page information extraction.Experiment results show that our prototype system can achieve 100%recall &97.64% precision.