Discovering interesting information with advances in web technology

Authors:
Richi Nayak;Pierre Senellart;Fabian M. Suchanek;Aparna S. Varde
Affiliations:
Queensland University of Technology, Brisbane, Australia;Institut Mines--Té/lé/com/ Té/lé/com ParisTech/ CNRS LTCI, Paris, France;Max Planck Institute for Informatics, Saarbrü/cken, Germany;Montclair State University, NJ, USA
Venue:
ACM SIGKDD Explorations Newsletter
Year:
2013

Citing 72
Cited 0

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
SSML: a speech synthesis markup language

Speech Communication
Focused crawling: a new approach to topic-specific Web resource discovery

WWW '99 Proceedings of the eighth international conference on World Wide Web
Data on the Web: from relations to semistructured data and XML

Data on the Web: from relations to semistructured data and XML
XTRACT: a system for extracting document type descriptors from XML documents

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Semantic extensions of XML for advanced applications

ITVE '01 Proceedings of the workshop on Information technology for virtual enterprises
Discovery of relational association rules

Relational Data Mining
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Discovery of frequent DATALOG patterns

Data Mining and Knowledge Discovery
Frequent Subgraph Discovery

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Clustering Ontology-Based Metadata in the Semantic Web

PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Crawling the Hidden Web

Proceedings of the 27th International Conference on Very Large Data Bases
RoadRunner: Towards Automatic Data Extraction from Large Web Sites

Proceedings of the 27th International Conference on Very Large Data Bases
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Relational Association Rules: Getting WARMeR

Proceedings of the ESF Exploratory Workshop on Pattern Detection and Discovery
The Latest MML (Medical Markup Language) Version 2.3—XML-Based Standard for Medical Data Exchange/Storage

Journal of Medical Systems
Extracting structured data from Web pages

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Efficient Mining of Frequent Subgraphs in the Presence of Isomorphism

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
A Semantic Web Primer

A Semantic Web Primer
Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Schema and ontology matching with COMA++

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Efficiently Mining Frequent Trees in a Forest: Algorithms and Applications

IEEE Transactions on Knowledge and Data Engineering
Duplicate Record Detection: A Survey

IEEE Transactions on Knowledge and Data Engineering
Mining Generalized Associations of Semantic Relations from Textual Web Content

IEEE Transactions on Knowledge and Data Engineering
Collective entity resolution in relational data

ACM Transactions on Knowledge Discovery from Data (TKDD)
XML schema clustering with semantic and hierarchical similarity measures

Knowledge-Based Systems
Accessing the deep web

Communications of the ACM - ACM at sixty: a look back in time
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
Leveraging data and structure in ontology integration

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Xproj: a framework for projected structural clustering of xml documents

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Distributed search over the hidden web: hierarchical database sampling and selection

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
MatML: XML for information exchange with materials property data

Proceedings of the 4th international workshop on Data mining standards, services and platforms
Context-aware wrapping: synchronized data extraction

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Fast and effective clustering of XML data using structural information

Knowledge and Information Systems
XML fever

Communications of the ACM - Web science
Linked data on the web (LDOW2008)

Proceedings of the 17th international conference on World Wide Web
Report on the XML mining track at INEX 2007 categorization and clustering of XML documents

ACM SIGIR Forum
Clustering XML Documents Using Closed Frequent Subtrees: A Structural Similarity Approach

Focused Access to XML Documents
Data fusion

ACM Computing Surveys (CSUR)
Automatic wrapper induction from hidden-web sources with domain knowledge

Proceedings of the 10th ACM workshop on Web information and data management
Learning Concept Mappings from Instance Similarity

ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Large scale integration of senses for the semantic web

Proceedings of the 18th international conference on World wide web
Supporting the discovery and labeling of non-taxonomic relationships in ontology learning

Expert Systems with Applications: An International Journal
Combining a Logical and a Numerical Method for Data Reconciliation

Journal on Data Semantics XII
RiMOM: A Dynamic Multistrategy Ontology Alignment Framework

IEEE Transactions on Knowledge and Data Engineering
L2R: a logical method for reference reconciliation

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Ontology matching with semantic verification

Web Semantics: Science, Services and Agents on the World Wide Web
Discovering and Maintaining Links on the Web of Data

ISWC '09 Proceedings of the 8th International Semantic Web Conference
Reducing OWL entailment to description logic satisfiability

Web Semantics: Science, Services and Agents on the World Wide Web
DL-Learner: Learning Concepts in Description Logics

The Journal of Machine Learning Research
Sig.ma: live views on the web of data

Proceedings of the 19th international conference on World wide web
An empirical study of instance-based ontology matching

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
DBpedia: a nucleus for a web of open data

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
The role of semantics in mining frequent patterns from knowledge bases in description logics with rules

Theory and Practice of Logic Programming
Learning first-order Horn clauses from web text

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
SameAs networks and beyond: analyzing deployment status and implications of owl:sameAs in linked data

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
When owl: sameAs isn't the same: an analysis of identity in linked data

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Mining association rules from semantic web data

IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part II
A self-training approach for resolving object coreference on the semantic web

Proceedings of the 20th international conference on World wide web
How matchable are four thousand ontologies on the semantic web

ESWC'11 Proceedings of the 8th extended semantic web conference on The semantic web: research and applications - Volume Part I
Inductive learning for the Semantic Web: What does it buy?

Semantic Web
Finding association rules in semantic web data

Knowledge-Based Systems
PARIS: probabilistic alignment of relations, instances, and schema

Proceedings of the VLDB Endowment
Leveraging terminological structure for object reconciliation

ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Bootstrapping domain ontology for semantic web services from source web sites

TES'05 Proceedings of the 6th international conference on Technologies for E-Services
Proceedings of the First international conference on Knowledge Discovery from XML Documents

KDXD'06 Proceedings of the First international conference on Knowledge Discovery from XML Documents
A General Framework for Mining Frequent Subgraphs from Labeled Graphs

Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
Frequent Subtree Mining - An Overview

Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
Acquiring temporal constraints between relations

Proceedings of the 21st ACM international conference on Information and knowledge management
LINDA: distributed web-of-data-scale entity matching

Proceedings of the 21st ACM international conference on Information and knowledge management
Quenchml: A semantics-preserving markup language for knowledge representation in quenching

Artificial Intelligence for Engineering Design, Analysis and Manufacturing
Combining structure and content similarities for XML document clustering

AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Web is a steadily evolving resource comprising much more than mere HTML pages. With its ever-growing data sources in a variety of formats, it provides great potential for knowledge discovery. In this article, we shed light on some interesting phenomena of the Web: the deep Web, which surfaces database records as Web pages; the Semantic Web, which defines meaningful data exchange formats; XML, which has established itself as a lingua franca for Web data exchange; and domain-specific markup languages, which are designed based on XML syntax with the goal of preserving semantics in targeted domains. We detail these four developments in Web technology, and explain how they can be used for data mining. Our goal is to show that all these areas can be as useful for knowledge discovery as the HTML-based part of the Web.