Automatic wrapper generation using tree matching and partial tree alignment
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Enhancing document structure analysis using visual analytics
Proceedings of the 2010 ACM Symposium on Applied Computing
Flexible reuse of middleware infrastructures in heterogeneous IT environments
OTM'07 Proceedings of the 2007 OTM Confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part I
The OXPath to success in the deep web
Proceedings of the 20th international conference companion on World wide web
Integrating semi-structured data into business applications: a web intelligence example
WM'05 Proceedings of the Third Biennial conference on Professional Knowledge Management
Information extraction for the semantic web
Proceedings of the First international conference on Reasoning Web
NGITS'06 Proceedings of the 6th international conference on Next Generation Information Technologies and Systems
Chapter 6: web data extraction for service creation
Search Computing
TEX: An efficient and effective unsupervised Web information extractor
Knowledge-Based Systems
OXPath: A language for scalable data extraction, automation, and crawling on the deep web
The VLDB Journal — The International Journal on Very Large Data Bases
Towards Comparative Mining of Web Document Objects with NFA: WebOMiner System
International Journal of Data Warehousing and Mining
Accelerating Structured Web Crawling without Losing Data
Proceedings of International Conference on Information Integration and Web-based Applications & Services
Hi-index | 0.00 |
Semi-automatic wrapper generation tools aim to ease the task of building structured views over web sources. But the wrapper generation techniques presented up to date show several weaknesses when dealing with the complex commercial web sources of today, specially when constructing advanced navigational sequences for accessing data. We present Wargo, a semi-automatic wrapper generation tool, which has been used by non-programmer staff to successfully wrap more than 700 commercial web sources in several industrial applications.