Extracting structured data from Web pages
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Thresher: automating the unwrapping of semantic content from the World Wide Web
WWW '05 Proceedings of the 14th international conference on World Wide Web
Interactive wrapper generation with minimal user effort
Proceedings of the 15th international conference on World Wide Web
A Survey of Web Information Extraction Systems
IEEE Transactions on Knowledge and Data Engineering
Making mashups with marmite: towards end-user programming for the web
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Mining templates from search result records of search engines
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Damia: data mashups for intranet applications
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Mashroom: end-user mashup programming using nested tables
Proceedings of the 18th international conference on World wide web
Situational data integration with data services and nested table
Service Oriented Computing and Applications
Hi-index | 0.00 |
There exist numerous online data sources on the Web. It is desirable to facilitate end-users to build XML-based wrappers from the data sources for further composition and reuse. This paper describes Grubber, a tool that allows end-users to develop XML-based wrappers from these data sources with just a few mouse clicks and keystrokes. An active learning algorithm was proposed and implemented to reduce end-users' effort. Experimental results on real-world sites show that the algorithm can achieve a high degree of effectiveness. Compared with other similar tools, Grubber includes a number of usability improvements to lower the barrier of usage and we believe it is suitable for mass end-users to build situational applications.