Combining Information Extraction Systems Using Voting and Stacked Generalization
The Journal of Machine Learning Research
WRAPPER INFERENCE FOR AMBIGUOUS WEB PAGES
Applied Artificial Intelligence
Using semantics to identify web objects
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Hi-index | 0.00 |
The Web has established itself as the dominantmedium for doing electronic commerce. Consequentlythe number of service providers, bothlarge and small, advertising their services on theweb continues to proliferate. In this paper we describenew extraction algorithms for mining servicedirectories from web pages. We develop anovel propagation technique for identifying andaccumulating all of the attributes related to a serviceentity in a web page. We provide experimentalresults of the effectiveness of our extractiontechniques by mining a database of veterinarianservice providers from web sources.