Algorithms for clustering data
Algorithms for clustering data
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs
SIAM Journal on Scientific Computing
Trawling the Web for emerging cyber-communities
WWW '99 Proceedings of the eighth international conference on World Wide Web
Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
Computers and Intractability: A Guide to the Theory of NP-Completeness
Computers and Intractability: A Guide to the Theory of NP-Completeness
Extracting Large-Scale Knowledge Bases from the Web
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Histogram-Based Approximation of Set-Valued Query-Answers
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Semantic Compression and Pattern Extraction with Fascicles
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Approximate Query Processing Using Wavelets
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
STOC '83 Proceedings of the fifteenth annual ACM symposium on Theory of computing
Gigascope: a stream database for network applications
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
The Link Database: Fast Access to Graphs of the Web
DCC '02 Proceedings of the Data Compression Conference
Towards Compressing Web Graphs
DCC '01 Proceedings of the Data Compression Conference
Compressing the Graph Structure of the Web
DCC '01 Proceedings of the Data Compression Conference
Exploiting Local Similarity for Indexing Paths in Graph-Structured Data
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
The webgraph framework I: compression techniques
Proceedings of the 13th international conference on World Wide Web
AutoPart: parameter-free graph partitioning and outlier detection
PKDD '04 Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases
A fast kernel-based multilevel algorithm for graph clustering
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Discovering large dense subgraphs in massive graphs
VLDB '05 Proceedings of the 31st international conference on Very large data bases
XSKETCH synopses for XML data graphs
ACM Transactions on Database Systems (TODS)
Role classification of hosts within enterprise networks based on connection patterns
ATEC '03 Proceedings of the annual conference on USENIX Annual Technical Conference
The generalized MDL approach for summarization
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Network monitoring using traffic dispersion graphs (tdgs)
Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
Discovering the staring people from social networks
Proceedings of the 18th international conference on World wide web
RECOMB 2'09 Proceedings of the 13th Annual International Conference on Research in Computational Molecular Biology
A Bipartite Graph Framework for Summarizing High-Dimensional Binary, Categorical and Numeric Data
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Graph OLAP: a multi-dimensional framework for graph data analysis
Knowledge and Information Systems
Graph clustering based on structural/attribute similarities
Proceedings of the VLDB Endowment
Mining graph patterns efficiently via randomized summaries
Proceedings of the VLDB Endowment
GConnect: a connectivity index for massive disk-resident graphs
Proceedings of the VLDB Endowment
A compact representation of graph databases
Proceedings of the Eighth Workshop on Mining and Learning with Graphs
On dense pattern mining in graph streams
Proceedings of the VLDB Endowment
Clustering Large Attributed Graphs: A Balance between Structural and Attribute Similarities
ACM Transactions on Knowledge Discovery from Data (TKDD)
Path-tree: An efficient reachability indexing scheme for large directed graphs
ACM Transactions on Database Systems (TODS)
Graph cube: on warehousing and OLAP multidimensional networks
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
On summarizing graph homogeneously
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications
Efficient topological OLAP on information networks
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Compression of weighted graphs
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
On sampling type distribution from heterogeneous social networks
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
RELIN: relatedness and informativeness-based centrality for entity summarization
ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
Link prediction for annotation graphs using graph summarization
ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
SISP: a new framework for searching the informative subgraph based on PSO
Proceedings of the 20th ACM international conference on Information and knowledge management
Skynets: searching for minimum trees in graphs with incomparable edge weights
Proceedings of the 20th ACM international conference on Information and knowledge management
Extracting between-pathway models from E-MAP interactions using expected graph compression
RECOMB'10 Proceedings of the 14th Annual international conference on Research in Computational Molecular Biology
Query preserving graph compression
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
PAnG: finding patterns in annotation graphs
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Multi-select faceted navigation based on minimum description length principle
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Summarization-based mining bipartite graphs
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Network compression by node and edge mergers
Bisociative Knowledge Discovery
Finding cross genome patterns in annotation graphs
DILS'12 Proceedings of the 8th international conference on Data Integration in the Life Sciences
On compressing weighted time-evolving graphs
Proceedings of the 21st ACM international conference on Information and knowledge management
Self-Organizing map and tree topology for graph summarization
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II
SWORD: scalable workload-aware data placement for transactional workloads
Proceedings of the 16th International Conference on Extending Database Technology
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Speeding up graph clustering via modular decomposition based compression
Proceedings of the 28th Annual ACM Symposium on Applied Computing
SynopSys: large graph analytics in the SAP HANA database through summarization
First International Workshop on Graph Data Management Experiences and Systems
Efficiency and precision trade-offs in graph summary algorithms
Proceedings of the 17th International Database Engineering & Applications Symposium
Frequent subgraph summarization with error control
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Realtime analysis of information diffusion in social media
Proceedings of the VLDB Endowment
Making queries tractable on big data with preprocessing: through the eyes of complexity theory
Proceedings of the VLDB Endowment
Summarizing answer graphs induced by keyword queries
Proceedings of the VLDB Endowment
Information Processing and Management: an International Journal
Hi-index | 0.00 |
We propose a highly compact two-part representation of a given graph G consisting of a graph summary and a set of corrections. The graph summary is an aggregate graph in which each node corresponds to a set of nodes in G, and each edge represents the edges between all pair of nodes in the two sets. On the other hand, the corrections portion specifies the list of edge-corrections that should be applied to the summary to recreate G. Our representations allow for both lossless and lossy graph compression with bounds on the introduced error. Further, in combination with the MDL principle, they yield highly intuitive coarse-level summaries of the input graph G. We develop algorithms to construct highly compressed graph representations with small sizes and guaranteed accuracy, and validate our approach through an extensive set of experiments with multiple real-life graph data sets. To the best of our knowledge, this is the first work to compute graph summaries using the MDL principle, and use the summaries (along with corrections) to compress graphs with bounded error.