SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Ariadne: a system for constructing mediators for Internet sources
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Optimizing recall/precision scores in IR over the WWW
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A hierarchical approach to wrapper induction
Proceedings of the third annual conference on Autonomous Agents
TrIAs: trainable information assistants for cooperative problem solving
Proceedings of the third annual conference on Autonomous Agents
Record-boundary discovery in Web documents
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
A Value-Driven System for Autonomous Information Gathering
Journal of Intelligent Information Systems
Querying websites using compact skeletons
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Automatic information extraction from web pages
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Hierarchical Wrapper Induction for Semistructured Information Sources
Autonomous Agents and Multi-Agent Systems
Supporting unified interface to wrapper generator in integrated information retrieval
Computer Standards & Interfaces - XML Diffusion: Transfer and differentiation
Acquiring and Structuring Web Content with Knowledge Level Models
EKAW '99 Proceedings of the 11th European Workshop on Knowledge Acquisition, Modeling and Management
Object-Oriented Mediator Queries to Internet Search Engines
OOIS '02 Proceedings of the Workshops on Advances in Object-Oriented Information Systems
Using Grammatical Inference to Automate Information Extraction from the Web
PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Towards Extensible Information Brokers Based on XML
CAiSE '00 Proceedings of the 12th International Conference on Advanced Information Systems Engineering
Solving Travel Problems by Integrating WEB Information with Planning
ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
Mediation in a dynamic context: arguing for a request-oriented approach and structuring it
Web-enabled systems integration
Designing wrapper components for e-services in integrating heterogeneous systems
The VLDB Journal — The International Journal on Very Large Data Bases
Querying websites using compact skeletons
Journal of Computer and System Sciences - Special issu on PODS 2001
A Fully Automated Object Extraction System for the World Wide Web
ICDCS '01 Proceedings of the The 21st International Conference on Distributed Computing Systems
A semi-universal e-commerce agent: domain-dependant information gathering
Enterprise information systems IV
Programming by Demonstration Using Version Space Algebra
Machine Learning
Ontology extraction and conceptual modeling for web information
Information modeling for internet applications
Engineering high-performance legacy codes as CORBA components for problem-solving environments
Journal of Parallel and Distributed Computing
Artificial Intelligence for Engineering Design, Analysis and Manufacturing
SGrid: a service-oriented model for the Semantic grid
Future Generation Computer Systems - Special issue: Semantic grid and knowledge grid: the next-generation web
Tree-Structured Template Generation for Web Pages
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Leveraging legacy codes to distributed problem-solving environments: a web services approach
Software—Practice & Experience
IEEE Transactions on Knowledge and Data Engineering
Argument-based critics and recommenders: a qualitative perspective on user support systems
Data & Knowledge Engineering - Special issue: WIDM 2004
GRID '05 Proceedings of the 6th IEEE/ACM International Workshop on Grid Computing
Data Extraction From Repositories On The Web: A Semi-Automatic Approach
Journal of Integrated Design & Process Science
A methodical approach to extracting interesting objects from dynamic web pages
International Journal of Web and Grid Services
An information extraction approach to reorganizing and summarizing specifications
Information and Software Technology
An integrated system of mining HTML texts and filtering structured documents
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Flexible reuse of middleware infrastructures in heterogeneous IT environments
OTM'07 Proceedings of the 2007 OTM Confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part I
From information to knowledge: harvesting entities and relationships from web sources
Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
European research and development of intelligent information agents: the agentlink perspective
Intelligent information agents
Instance discovery and schema matching with applications to biological deep web data integration
DILS'10 Proceedings of the 7th international conference on Data integration in the life sciences
From the web of data to a world of action
Web Semantics: Science, Services and Agents on the World Wide Web
Improving web data annotations with spreading activation
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Semantic partitioning of web pages
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Learning layouts of biological datasets semi-automatically
DILS'05 Proceedings of the Second international conference on Data Integration in the Life Sciences
Structure detection system from web documents through backpropagation network learning
AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Cost effective ontology population with data from lists in OCRed historical documents
Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing
Hi-index | 0.00 |
To simplify the task of obtaining information from the vast number of information sources that are available on the World Wide Web (WWW), the authors are building information mediators for extracting and integrating data from multiple Web sources. In a mediator based approach, wrappers are built around individual information sources to translate between the mediator query language and the individual sources. They present an approach for semi-automatically generating wrappers for structured Internet sources. The key idea is to exploit formatting information in Web pages to hypothesize the underlying structure of a page. From this structure the system generates a wrapper that facilitates querying of a source and possibly integrating it with other sources. They demonstrate the ease with which they are able to build wrappers for a number of Web sources using their implemented wrapper generation toolkit.