Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
A query language and optimization techniques for unstructured data
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Discovering typical structures of documents: a road map approach
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
Representative Objects: Concise Representations of Semistructured, Hierarchial Data
ICDE '97 Proceedings of the Thirteenth International Conference on Data Engineering
Object Exchange Across Heterogeneous Information Sources
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
ICDT '97 Proceedings of the 6th International Conference on Database Theory
Fast Algorithms for Mining Association Rules in Large Databases
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
W3QS: A Query System for the World-Wide Web
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Interactive Query Formulation in Semistructured Databases
FQAS '02 Proceedings of the 5th International Conference on Flexible Query Answering Systems
Schema Mining: Finding Structural Regularity among Semistructured Data
PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Optimized Substructure Discovery for Semi-structured Data
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Structuring Domain-Specific Text Archives by Deriving a Probabilistic XML DTD
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
A General Architecture for Finding Structural Regularities on the Web
AIMSA '00 Proceedings of the 9th International Conference on Artificial Intelligence: Methodology, Systems, and Applications
A Semantic Approach to Integrating XML and Structured Data Sources
CAiSE '01 Proceedings of the 13th International Conference on Advanced Information Systems Engineering
Discovery of Frequent Tree Structured Patterns in Semistructured Web Documents
PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
Extracting Characteristic Structures among Words in Semistructured Documents
PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Discovery of Frequent Tag Tree Patterns in Semistructured Web Documents
PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Polynomial Time Algorithms for Finding Unordered Tree Patterns with Internal Variables
FCT '01 Proceedings of the 13th International Symposium on Fundamentals of Computation Theory
Discovering informative content blocks from Web documents
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
FlexiMine – A Flexible Platform for KDD Research and Application Development
Annals of Mathematics and Artificial Intelligence
Mining Web Informative Structures and Contents Based on Entropy Analysis
IEEE Transactions on Knowledge and Data Engineering
A structural adviser for the XML document authoring
Proceedings of the 2003 ACM symposium on Document engineering
Unordered Tree Mining with Applications to Phylogeny
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Finding hot query patterns over an XQuery stream
The VLDB Journal — The International Journal on Very Large Data Bases
WISDOM: Web Intrapage Informative Structure Mining Based on Document Object Model
IEEE Transactions on Knowledge and Data Engineering
Indexing Useful Structural Patterns for XML Query Processing
IEEE Transactions on Knowledge and Data Engineering
Finding Patterns on Protein Surfaces: Algorithms and Applications to Protein Classification
IEEE Transactions on Knowledge and Data Engineering
Information extraction from structured documents using k-testable tree automaton inference
Data & Knowledge Engineering
Data & Knowledge Engineering - Special issue: WIDM 2004
XML structural delta mining: issues and challenges
Data & Knowledge Engineering - Special issue: ER 2003
Soft constraint based pattern mining
Data & Knowledge Engineering
FAT-miner: mining frequent attribute trees
Proceedings of the 2007 ACM symposium on Applied computing
Discovering frequent geometric subgraphs
Information Systems
Efficient mining of XML query patterns for caching
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Pattern detection from web using AFA set theory
Proceedings of the 9th annual ACM international workshop on Web information and data management
Efficient mining of frequent XML query patterns with repeating-siblings
Information and Software Technology
An XML-enabled data mining query language: XML-DMQL
International Journal of Business Intelligence and Data Mining
Incremental sequence-based frequent query pattern mining from XML queries
Data Mining and Knowledge Discovery
Proceedings of the 2005 conference on Multi-Relational Data Mining
A data mining based method for web site maintenance
Intelligent Data Analysis
Mining Tree-Based Frequent Patterns from XML
FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Efficiently maintaining structural associations of semistructured data
PCI'01 Proceedings of the 8th Panhellenic conference on Informatics
Extraction of tag tree patterns with contractible variables from irregular semistructured data
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
ILP'02 Proceedings of the 12th international conference on Inductive logic programming
Application of tree mining to matching of knowledge structures of decision tree type
OTM'07 Proceedings of the 2007 OTM Confederated international conference on On the move to meaningful internet systems - Volume Part II
Efficient algorithms for finding frequent substructures from semi-structured data streams
JSAI'03/JSAI04 Proceedings of the 2003 and 2004 international conference on New frontiers in artificial intelligence
Mining interesting XML-enabled association rules with templates
KDID'04 Proceedings of the Third international conference on Knowledge Discovery in Inductive Databases
Sequential pattern mining for structure-based XML document classification
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
A polynomial time matching algorithm of ordered tree patterns having height-constrained variables
CPM'05 Proceedings of the 16th annual conference on Combinatorial Pattern Matching
WDEE: web data extraction by example
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Hi-index | 0.00 |
Many semistructured objects are similarly, though not identically, structured. We study the problem of discovering 驴typical驴 substructures of a collection of semistructured objects. The discovered structures can serve the following purposes: 1) the 驴table-of-contents驴 for gaining general information of a source, 2) a road map for browsing and querying information sources, 3) a basis for clustering documents, 4) partial schemas for providing standard database access methods, and 5) user/customer's interests and browsing patterns. The discovery task is impacted by structural features of semistructured data in a nontrivial way and traditional data mining frameworks are inapplicable. We define this discovery problem and propose a solution.