The Semantic Web: A Guide to the Future of XML, Web Services, and Knowledge Management
The Semantic Web: A Guide to the Future of XML, Web Services, and Knowledge Management
Odaies: Ontology-driven Adaptive Web Information Extraction System
IAT '03 Proceedings of the IEEE/WIC International Conference on Intelligent Agent Technology
Mining data records in Web pages
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
IEEE Transactions on Knowledge and Data Engineering
Learning Object Models from Semistructured Web Documents
IEEE Transactions on Knowledge and Data Engineering
A Survey of Web Information Extraction Systems
IEEE Transactions on Knowledge and Data Engineering
NET – a system for extracting web data from flat and nested data records
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Hi-index | 0.00 |
Information extraction (IE) has been emerged as a noveldiscipline in computer science. In IE, intelligent algorithms areemployed to extract the required data, and structure them so thatthey are appropriate for query. In most IE systems, a web-pagestructure, e.g. HTML tags are used to recognize the looked-forinformation. In this article, an algorithm is developed torecognize the main region of web-pages containing the looked-forinformation, by means of an ontology, a web-page structure andgoodness-of-fit Χ2 test. After recognizingthe main region, the existing records of the region are recognized,and then each record is put in a text file.