Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
SSML: a speech synthesis markup language
Speech Communication
Focused crawling: a new approach to topic-specific Web resource discovery
WWW '99 Proceedings of the eighth international conference on World Wide Web
Data on the Web: from relations to semistructured data and XML
Data on the Web: from relations to semistructured data and XML
XTRACT: a system for extracting document type descriptors from XML documents
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Semantic extensions of XML for advanced applications
ITVE '01 Proceedings of the workshop on Information technology for virtual enterprises
Discovery of relational association rules
Relational Data Mining
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Discovery of frequent DATALOG patterns
Data Mining and Knowledge Discovery
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Clustering Ontology-Based Metadata in the Semantic Web
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Proceedings of the 27th International Conference on Very Large Data Bases
RoadRunner: Towards Automatic Data Extraction from Large Web Sites
Proceedings of the 27th International Conference on Very Large Data Bases
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Relational Association Rules: Getting WARMeR
Proceedings of the ESF Exploratory Workshop on Pattern Detection and Discovery
Journal of Medical Systems
Extracting structured data from Web pages
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Efficient Mining of Frequent Subgraphs in the Presence of Isomorphism
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
A Semantic Web Primer
Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Schema and ontology matching with COMA++
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Efficiently Mining Frequent Trees in a Forest: Algorithms and Applications
IEEE Transactions on Knowledge and Data Engineering
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Mining Generalized Associations of Semantic Relations from Textual Web Content
IEEE Transactions on Knowledge and Data Engineering
Collective entity resolution in relational data
ACM Transactions on Knowledge Discovery from Data (TKDD)
XML schema clustering with semantic and hierarchical similarity measures
Knowledge-Based Systems
Communications of the ACM - ACM at sixty: a look back in time
Yago: a core of semantic knowledge
Proceedings of the 16th international conference on World Wide Web
Leveraging data and structure in ontology integration
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Xproj: a framework for projected structural clustering of xml documents
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Distributed search over the hidden web: hierarchical database sampling and selection
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
MatML: XML for information exchange with materials property data
Proceedings of the 4th international workshop on Data mining standards, services and platforms
Context-aware wrapping: synchronized data extraction
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Fast and effective clustering of XML data using structural information
Knowledge and Information Systems
Communications of the ACM - Web science
Linked data on the web (LDOW2008)
Proceedings of the 17th international conference on World Wide Web
Clustering XML Documents Using Closed Frequent Subtrees: A Structural Similarity Approach
Focused Access to XML Documents
ACM Computing Surveys (CSUR)
Automatic wrapper induction from hidden-web sources with domain knowledge
Proceedings of the 10th ACM workshop on Web information and data management
Learning Concept Mappings from Instance Similarity
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Large scale integration of senses for the semantic web
Proceedings of the 18th international conference on World wide web
Supporting the discovery and labeling of non-taxonomic relationships in ontology learning
Expert Systems with Applications: An International Journal
Combining a Logical and a Numerical Method for Data Reconciliation
Journal on Data Semantics XII
RiMOM: A Dynamic Multistrategy Ontology Alignment Framework
IEEE Transactions on Knowledge and Data Engineering
L2R: a logical method for reference reconciliation
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Ontology matching with semantic verification
Web Semantics: Science, Services and Agents on the World Wide Web
Discovering and Maintaining Links on the Web of Data
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Reducing OWL entailment to description logic satisfiability
Web Semantics: Science, Services and Agents on the World Wide Web
DL-Learner: Learning Concepts in Description Logics
The Journal of Machine Learning Research
Sig.ma: live views on the web of data
Proceedings of the 19th international conference on World wide web
An empirical study of instance-based ontology matching
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
DBpedia: a nucleus for a web of open data
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Theory and Practice of Logic Programming
Learning first-order Horn clauses from web text
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
When owl: sameAs isn't the same: an analysis of identity in linked data
ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Mining association rules from semantic web data
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part II
A self-training approach for resolving object coreference on the semantic web
Proceedings of the 20th international conference on World wide web
How matchable are four thousand ontologies on the semantic web
ESWC'11 Proceedings of the 8th extended semantic web conference on The semantic web: research and applications - Volume Part I
Finding association rules in semantic web data
Knowledge-Based Systems
PARIS: probabilistic alignment of relations, instances, and schema
Proceedings of the VLDB Endowment
Leveraging terminological structure for object reconciliation
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Bootstrapping domain ontology for semantic web services from source web sites
TES'05 Proceedings of the 6th international conference on Technologies for E-Services
Proceedings of the First international conference on Knowledge Discovery from XML Documents
KDXD'06 Proceedings of the First international conference on Knowledge Discovery from XML Documents
A General Framework for Mining Frequent Subgraphs from Labeled Graphs
Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
Frequent Subtree Mining - An Overview
Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
Acquiring temporal constraints between relations
Proceedings of the 21st ACM international conference on Information and knowledge management
LINDA: distributed web-of-data-scale entity matching
Proceedings of the 21st ACM international conference on Information and knowledge management
Quenchml: A semantics-preserving markup language for knowledge representation in quenching
Artificial Intelligence for Engineering Design, Analysis and Manufacturing
Combining structure and content similarities for XML document clustering
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Hi-index | 0.00 |
The Web is a steadily evolving resource comprising much more than mere HTML pages. With its ever-growing data sources in a variety of formats, it provides great potential for knowledge discovery. In this article, we shed light on some interesting phenomena of the Web: the deep Web, which surfaces database records as Web pages; the Semantic Web, which defines meaningful data exchange formats; XML, which has established itself as a lingua franca for Web data exchange; and domain-specific markup languages, which are designed based on XML syntax with the goal of preserving semantics in targeted domains. We detail these four developments in Web technology, and explain how they can be used for data mining. Our goal is to show that all these areas can be as useful for knowledge discovery as the HTML-based part of the Web.