Federated database systems for managing distributed, heterogeneous, and autonomous databases
ACM Computing Surveys (CSUR) - Special issue on heterogeneous databases
The data webhouse toolkit: building the web-enabled data warehouse
The data webhouse toolkit: building the web-enabled data warehouse
An event-condition-action language for XML
Proceedings of the 11th international conference on World Wide Web
Active Rules in Database Systems
Active Rules in Database Systems
The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling
The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling
Active data warehouses: complementing OLAP with analysis rules
Data & Knowledge Engineering - Data warehousing
Fast Algorithms for Mining Association Rules in Large Databases
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Web services and data integration
WISE '02 Proceedings of the 3rd International Conference on Web Information Systems Engineering
Xyleme: A Dynamic Warehouse for XML Data of the Web
IDEAS '01 Proceedings of the International Database Engineering & Applications Symposium
XML Data Warehouse: Modelling and Querying
Proceedings of the Baltic Conference, BalticDB&IS 2002 - Volume 1
Web Data Management
Exchanging intensional XML data
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
XCube: XML for data warehouses
DOLAP '03 Proceedings of the 6th ACM international workshop on Data warehousing and OLAP
Service-Oriented Architecture: A Field Guide to Integrating XML and Web Services
Service-Oriented Architecture: A Field Guide to Integrating XML and Web Services
Proceedings of the 13th international conference on World Wide Web
Dynamic Data Integration Using Web Services
ICWS '04 Proceedings of the IEEE International Conference on Web Services
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
ETL queues for active data warehousing
Proceedings of the 2nd international workshop on Information quality in information systems
ACM SIGMOD Record
Conceptual Design of an XML FACT Repository for Dispersed XML Document Warehouses and XML Marts
CIT '05 Proceedings of the The Fifth International Conference on Computer and Information Technology
Research issues in data stream association rule mining
ACM SIGMOD Record
Efficient SIP-Specific Event Notification
ICNICONSMCL '06 Proceedings of the International Conference on Networking, International Conference on Systems and International Conference on Mobile Communications and Learning Technologies
Data integration: the teenage years
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Processing And Managing Complex Data for Decision Support
Processing And Managing Complex Data for Decision Support
XML structural delta mining: issues and challenges
Data & Knowledge Engineering - Special issue: ER 2003
Integrating deep web data sources
Integrating deep web data sources
XCraft: boosting the performance of active XML materialization
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Supporting the ETL-process by Web Service technologies
International Journal of Web and Grid Services
Integrating Data Warehouses with Web Data: A Survey
IEEE Transactions on Knowledge and Data Engineering
The Active XML project: an overview
The VLDB Journal — The International Journal on Very Large Data Bases
Warehousing complex data from the web
International Journal of Web Engineering and Technology
Towards automatic generation of AXML web services for dynamic data integration
DataX '08 Proceedings of the 2008 EDBT workshop on Database technologies for handling XML information on the web
OptimAX: Optimizing Distributed ActiveXML Applications
ICWE '08 Proceedings of the 2008 Eighth International Conference on Web Engineering
An Event-Based Near Real-Time Data Integration Architecture
EDOCW '08 Proceedings of the 2008 12th Enterprise Distributed Object Computing Conference Workshops
ACM SIGMOD Record
CloudFuice: a flexible cloud-based data integration system
ICWE'11 Proceedings of the 11th international conference on Web engineering
X-HYBRIDJOIN for near-real-time data warehousing
BNCOD'11 Proceedings of the 28th British national conference on Advances in databases
Efficient incremental breadth-depth XML event mining
Proceedings of the 15th Symposium on International Database Engineering & Applications
XML-OLAP: a multidimensional analysis framework for XML warehouses
DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
X-warehousing: an XML-based approach for warehousing complex data
ADBIS'06 Proceedings of the 10th East European conference on Advances in Databases and Information Systems
Information Systems Frontiers
On the improvement of active XML (AXML) representation and query evaluation
Information Systems Frontiers
Business Intelligence and the Web
Information Systems Frontiers
Hi-index | 0.00 |
Today, the Web is the largest source of information worldwide. There is currently a strong trend for decision-making applications such as Data Warehousing (DW) and Business Intelligence (BI) to move onto the Web, especially in the cloud. Integrating data into DW/BI applications is a critical and time-consuming task. To make better decisions in DW/BI applications, next generation data integration poses new requirements to data integration systems, over those posed by traditional data integration. In this paper, we propose a generic, metadata-based, service-oriented, and event-driven approach for integrating Web data timely and autonomously. Beside handling data heterogeneity, distribution and interoperability, our approach satisfies near real-time requirements and realize active data integration. For this sake, we design and develop a framework that utilizes Web standards (e.g., XML and Web services) for tackling data heterogeneity, distribution and interoperability issues. Moreover, our framework utilizes Active XML (AXML) to warehouse passive data as well as services to integrate active and dynamic data on-the-fly. AXML embedded services and changes detection services ensure near real-time data integration. Furthermore, the idea of integrating Web data actively and autonomously revolves around mining events logged by the data integration environment. Therefore, we propose an incremental XML-based algorithm for mining association rules from logged events. Then, we define active rules dynamically upon mined data to automate and reactivate integration tasks. Finally, as a proof of concept, we implement a framework prototype as a Web application using open-source tools.