Modeling Web sources for information integration
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Handling inconsistency for multi-source integration
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Query Learning Strategies Using Boosting and Bagging
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Hi-index | 0.00 |
Many problems arise when trying to integrate information from multiple sources on the web. One of these problems is that data instances can exist in inconsistent formats across several sources. An example application of information integration is trying to integrate all the reviews of Los Angeles restaurants from Yahoo's Restaurants webpage with the current health rating for each restaurant from the LA County Department of Health's website. Integrating these sources requires determining if they share any of the same restaurants by comparing the data instances from both sources (Figure 1). Because the instances can be in different formats, e.g. the restaurant "Jerry's Famous Deli" from Yahoo's webpage can appear as "Jerry's Famous Delicatessen" in the Dept. of Health's source, they can not be compared using equality; but must be judged according to similarity.