Federated database systems for managing distributed, heterogeneous, and autonomous databases
ACM Computing Surveys (CSUR) - Special issue on heterogeneous databases
An improved third normal form for relational databases
ACM Transactions on Database Systems (TODS)
A relational model of data for large shared data banks
Communications of the ACM
Transforming Relational Database Schemas into Object-Oriented Schemas according to ODMG-93
DOOD '95 Proceedings of the Fourth International Conference on Deductive and Object-Oriented Databases
Metrics for Evaluating the Quality of Entity Relationship Models
ER '98 Proceedings of the 17th International Conference on Conceptual Modeling
Cost-Effective Semantic Annotation of XML Schemas and Web Service Interfaces
SCC '09 Proceedings of the 2009 IEEE International Conference on Services Computing
Web Intelligence and Agent Systems
Hi-index | 0.00 |
This paper addresses the problem of identifying redundant data in large-scale service-oriented information systems. Specifically, the paper puts forward an automated method to pinpoint potentially redundant data attributes from a given collection of semantically-annotated Web service interfaces. The key idea is to construct a service network to represent all input and output dependencies between data attributes and operations captured in the service interfaces, and to apply centrality measures from network theory in order to quantify the degree to which an attribute belongs to a given subsystem. The proposed method was tested on a federated governmental information system consisting of 58 independently-maintained information systems providing altogether about 1000 service operations described in WSDL. The accuracy of the method is evaluated in terms of precision and recall.