Mining knowledge from databases: an information network analysis approach

Authors:
Jiawei Han;Yizhou Sun;Xifeng Yan;Philip S. Yu
Affiliations:
University of Illinois at Urbana-Champaign, Urbana, IL, USA;University of Illinois at Urbana-Champaign, Urbana, IL, USA;University of California at Santa Barbara, Santa Barbara, CA, USA;University of Illinois at Chicago, Chicago, IL, USA
Venue:
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Year:
2010

Citing 10
Cited 4

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
SimRank: a measure of structural-context similarity

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient Classification across Multiple Database Relations: A CrossMine Approach

IEEE Transactions on Knowledge and Data Engineering
SCAN: a structural clustering algorithm for networks

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
CrossClus: user-guided multi-relational clustering

Data Mining and Knowledge Discovery
Truth Discovery with Multiple Conflicting Information Providers on the Web

IEEE Transactions on Knowledge and Data Engineering
RankClus: integrating clustering with ranking for heterogeneous information network analysis

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Ranking-based clustering of heterogeneous information networks with star network schema

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Exploring social tagging graph for web object classification

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
The web as a graph: measurements, models, and methods

COCOON'99 Proceedings of the 5th annual international conference on Computing and combinatorics

Three challenges in data mining

Frontiers of Computer Science in China
WINACS: construction and analysis of web-based computer science information networks

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Finding information nebula over large networks

Proceedings of the 20th ACM international conference on Information and knowledge management
Ranking objects by following paths in entity-relationship graphs

Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most people consider a database is merely a data repository that supports data storage and retrieval. Actually, a database contains rich, inter-related, multi-typed data and information, forming one or a set of gigantic, interconnected, heterogeneous information networks. Much knowledge can be derived from such information networks if we systematically develop an effective and scalable database-oriented information network analysis technology. In this tutorial, we introduce database-oriented information network analysis methods and demonstrate how information networks can be used to improve data quality and consistency, facilitate data integration, and generate interesting knowledge. This tutorial presents an organized picture on how to turn a database into one or a set of organized heterogeneous information networks, how information networks can be used for data cleaning, data consolidation, and data qualify improvement, how to discover various kinds of knowledge from information networks, how to perform OLAP in information networks, and how to transform database data into knowledge by information network analysis. Moreover, we present interesting case studies on real datasets, including DBLP and Flickr, and show how interesting and organized knowledge can be generated from database-oriented information networks.