ACM Transactions on Database Systems (TODS)
Management of heterogeneous and autonomous database systems
Management of heterogeneous and autonomous database systems
Learning Information Extraction Rules for Semi-Structured and Free Text
Machine Learning - Special issue on natural language learning
Efficient and extensible algorithms for multi query optimization
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Pipelining in multi-query optimization
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Building the Data Warehouse
Conceptual modeling for ETL processes
Proceedings of the 5th ACM international workshop on Data Warehousing and OLAP
Information Extraction with HMM Structures Learned by Stochastic Optimization
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Optimizing ETL Processes in Data Warehouses
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Clio grows up: from research prototype to industrial tool
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Data integration: the teenage years
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Designing ETL processes using semantic web technologies
DOLAP '06 Proceedings of the 9th ACM international workshop on Data warehousing and OLAP
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Deciding the physical implementation of ETL workflows
Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Survey of Text Mining II: Clustering, Classification, and Retrieval
Survey of Text Mining II: Clustering, Classification, and Retrieval
Advanced Data Warehouse Design: From Conventional to Spatial and Temporal Applications (Data-Centric Systems and Applications)
Tapping into unstructured data: integrating unstructured data and textual analytics into business intelligence
DW 2.0: The Architecture for the Next Generation of Data Warehousing
DW 2.0: The Architecture for the Next Generation of Data Warehousing
Automating the loading of business process data warehouses
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
RiTE: Providing On-Demand Data for Right-Time Data Warehousing
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
ETL workflows: from formal specification to optimization
ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
OptBPEL: a tool for performance optimization of BPEL process
SC'08 Proceedings of the 7th international conference on Software composition
Enabling outsourced service providers to think globally while acting locally
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
DNIS'05 Proceedings of the 4th international conference on Databases in Networked Information Systems
Blueprints and measures for ETL workflows
ER'05 Proceedings of the 24th international conference on Conceptual Modeling
QoX-driven ETL design: reducing the cost of ETL consulting engagements
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Virtual Business Operating Environment in the Cloud: Conceptual Architecture and Challenges
ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
BP-Ex: a uniform query engine for business process execution traces
Proceedings of the 13th International Conference on Extending Database Technology
Leveraging web streams for contractual situational awareness in operational BI
Proceedings of the 2010 EDBT/ICDT Workshops
SIE-OBI: a streaming information extraction platform for operational business intelligence
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Leveraging business process models for ETL design
ER'10 Proceedings of the 29th international conference on Conceptual modeling
Designing integration flows using hypercubes
Proceedings of the 14th International Conference on Extending Database Technology
Live business intelligence for the real-time enterprise
From active data management to event-based systems and more
GEM: requirement-driven generation of ETL and multidimensional conceptual designs
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
TTL: a transformation, transference and loading approach for active monitoring
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
A platform for situational awareness in operational BI
Decision Support Systems
Information extraction, real-time processing and DW2.0 in operational business intelligence
DNIS'10 Proceedings of the 6th international conference on Databases in Networked Information Systems
On optimizing workflows using query processing techniques
SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
Optimization of analytic data flows for next generation business intelligence applications
TPCTC'11 Proceedings of the Third TPC Technology conference on Topics in Performance Evaluation, Measurement and Characterization
Proceedings of the fifteenth international workshop on Data warehousing and OLAP
Schema decryption for large extract-transform-load systems
ER'12 Proceedings of the 31st international conference on Conceptual Modeling
Data management perspectives on business process management: tutorial overview
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Scheduling strategies for efficient ETL execution
Information Systems
A QoX model for ETL subsystems: theoretical and industry perspectives
Proceedings of the 14th International Conference on Computer Systems and Technologies
Lazy ETL in action: ETL technology dates scientific data
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
Business Intelligence (BI) refers to technologies, tools, and practices for collecting, integrating, analyzing, and presenting large volumes of information to enable better decision making. Today's BI architecture typically consists of a data warehouse (or one or more data marts), which consolidates data from several operational databases, and serves a variety of front-end querying, reporting, and analytic tools. The back-end of the architecture is a data integration pipeline for populating the data warehouse by extracting data from distributed and usually heterogeneous operational sources; cleansing, integrating and transforming the data; and loading it into the data warehouse. Since BI systems have been used primarily for off-line, strategic decision making, the traditional data integration pipeline is a oneway, batch process, usually implemented by extract-transform-load (ETL) tools. The design and implementation of the ETL pipeline is largely a labor-intensive activity, and typically consumes a large fraction of the effort in data warehousing projects. Increasingly, as enterprises become more automated, data-driven, and real-time, the BI architecture is evolving to support operational decision making. This imposes additional requirements and tradeoffs, resulting in even more complexity in the design of data integration flows. These include reducing the latency so that near real-time data can be delivered to the data warehouse, extracting information from a wider variety of data sources, extending the rigidly serial ETL pipeline to more general data flows, and considering alternative physical implementations. We describe the requirements for data integration flows in this next generation of operational BI system, the limitations of current technologies, the research challenges in meeting these requirements, and a framework for addressing these challenges. The goal is to facilitate the design and implementation of optimal flows to meet business requirements.