A general formal framework for schema transformation
Data & Knowledge Engineering - Special issue on ER '97
The entity-relationship model—toward a unified view of data
ACM Transactions on Database Systems (TODS) - Special issue: papers from the international conference on very large data bases: September 22–24, 1975, Framingham, MA
Towards the Reverse Engineering of Denormalized Relational Databases
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Lessons Learned in Data Reverse Engineering
WCRE '01 Proceedings of the Eighth Working Conference on Reverse Engineering (WCRE'01)
Reverse Engineering for Web Data: From Visual to Semantic Structures
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Fundamentals of Database Systems, Fourth Edition
Fundamentals of Database Systems, Fourth Edition
Hierarchies in a multidimensional model: from conceptual modeling to logical representation
Data & Knowledge Engineering - Special issue: WIDM 2004
A UML-based data warehouse design method
Decision Support Systems
Data & Knowledge Engineering
Dynamic Analysis of SQL Statements for Data-Intensive Applications Reverse Engineering
WCRE '08 Proceedings of the 2008 15th Working Conference on Reverse Engineering
Solving summarizability problems in fact-dimension relationships for multidimensional models
Proceedings of the ACM 11th international workshop on Data warehousing and OLAP
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Data set preprocessing and transformation in a database system
Intelligent Data Analysis
Conceptual modeling for classification mining in data warehouses
DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
IEEE Transactions on Knowledge and Data Engineering
Mega-modeling for big data analytics
ER'12 Proceedings of the 31st international conference on Conceptual Modeling
EERMM: a metamodel for the enhanced entity-relationship model
ER'12 Proceedings of the 31st international conference on Conceptual Modeling
Hi-index | 0.00 |
In a data mining project developed on a relational database, a significant effort is required to build a data set for analysis. The main reason is that, in general, the database has a collection of normalized tables that must be joined, aggregated and transformed in order to build the required data set. Such scenario results in many complex SQL queries that are written independently from each other, in a disorganized manner. Therefore, the database grows with many tables and views that are not present as entities in the ER model and similar SQL queries are written multiple times, creating problems in database evolution and software maintenance. In this paper, we classify potential database transformations, we extend an ER diagram with entities capturing database transformations and we introduce an algorithm which automates the creation of such extended ER model. We present a case study with a public database illustrating database transformations to build a data set to compute a typical data mining model.