Template extraction from candidate template set generation: a structure and content approach
Proceedings of the 43rd annual Southeast regional conference - Volume 2
DOM semantic expansion-based extraction of topical information from web pages
WISM'11 Proceedings of the 2011 international conference on Web information systems and mining - Volume Part II
CCWrapper: adaptive predefined schema guided web extraction
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Hi-index | 0.00 |
As WWW becomes more and more popular and powerful, how to search information on the web in database way becomes an important research topic. COMMIX, which is developed in the DB group in Peking University (China), is a system towards building very large database using data from the Web for information extraction, integration and query answering. COMMIX has some innovative features, such as ontology-based wrapper generation, XML-based information integration, view-based query answering, and QBE-style XML query interface.