A comparative analysis of methodologies for database schema integration
ACM Computing Surveys (CSUR)
Template-based wrappers in the TSIMMIS system
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Lore: a database management system for semistructured data
ACM SIGMOD Record
A Query Translation Scheme for Rapid Implementation of Wrappers
DOOD '95 Proceedings of the Fourth International Conference on Deductive and Object-Oriented Databases
Object Exchange Across Heterogeneous Information Sources
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
The Rufus System: Information Organization for Semi-Structured Data
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Object Fusion in Mediator Systems
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Towards heterogeneous multimedia information systems: the Garlic approach
RIDE '95 Proceedings of the 5th International Workshop on Research Issues in Data Engineering-Distributed Object Management (RIDE-DOM'95)
A survey in indexing and searching XML documents
Journal of the American Society for Information Science and Technology - XML
A brief survey of web data extraction tools
ACM SIGMOD Record
DEByE - Date extraction by example
Data & Knowledge Engineering
Managing Scientific Metadata Using XML
IEEE Internet Computing
A Survey of Web Information Extraction Systems
IEEE Transactions on Knowledge and Data Engineering
Web wrapper induction: a brief survey
AI Communications
Data Extraction From Repositories On The Web: A Semi-Automatic Approach
Journal of Integrated Design & Process Science
Context-aware wrapping: synchronized data extraction
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Extracting lists of data records from semi-structured web pages
Data & Knowledge Engineering
ACM Computing Surveys (CSUR)
Information extraction for search engines using fast heuristic techniques
Data & Knowledge Engineering
Visual extraction of information from web pages
Journal of Visual Languages and Computing
Finding and Extracting Data Records from Web Pages
Journal of Signal Processing Systems
A method for web information extraction
APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Web news extraction based on path pattern mining
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
A simhash-based scheme for locating product information from the web
Proceedings of the Second Symposium on Information and Communication Technology
The SEWASIE multi-agent system
AP2PC'04 Proceedings of the Third international conference on Agents and Peer-to-Peer Computing
Chapter 6: web data extraction for service creation
Search Computing
ICCSA'10 Proceedings of the 2010 international conference on Computational Science and Its Applications - Volume Part II
The HiLeX system for semantic information extraction
Transactions on Large-Scale Data- and Knowledge-Centered Systems V
Data extraction from web pages based on structural-semantic entropy
Proceedings of the 21st international conference companion on World Wide Web
AMBER: turning annotations into knowledge
Proceedings of the 21st international conference companion on World Wide Web
TEX: An efficient and effective unsupervised Web information extractor
Knowledge-Based Systems
A general theory of spatial relations to support a graphical tool for visual information extraction
Journal of Visual Languages and Computing
Hi-index | 0.00 |
In this paper we discuss themanagement of semi-structured data, i.e., data that has irregular or dynamically changing structure. We describe components of the Stanford TSIMMIS Project that help extract semi-structured data from Web pages, that allow the storage and querying of semi-structured data, and that allow its browsing through the World Wide Web. A prototype implementation of the TSIMMIS system as described here is currently installed and running in the database group testbed.