Research problems in data warehousing
CIKM '95 Proceedings of the fourth international conference on Information and knowledge management
Managing historical semistructured data
Theory and Practice of Object Systems
NiagaraCQ: a scalable continuous query system for Internet databases
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
The C++ Programming Language
The TSQL2 Temporal Query Language
The TSQL2 Temporal Query Language
Active Database Systems: Triggers and Rules for Advanced Database Processing
Active Database Systems: Triggers and Rules for Advanced Database Processing
Continual Queries for Internet Scale Event-Driven Information Delivery
IEEE Transactions on Knowledge and Data Engineering
Representing and Querying Changes in Semistructured Data
ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
ICDE '99 Proceedings of the 15th International Conference on Data Engineering
A Web Odyssey: from Codd to XML
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Models and issues in data stream systems
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Validating streaming XML documents
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Characterizing memory requirements for queries over continuous data streams
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
IntelliShopper: a proactive, personal, private shopping assistant
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Continuous queries over data streams
ACM SIGMOD Record
On Efficient Matching of Streaming XML Documents and Queries
EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
Processing XML Streams with Deterministic Automata
ICDT '03 Proceedings of the 9th International Conference on Database Theory
WebFilter: A High-throughput XML-based Publish and Subscribe System
Proceedings of the 27th International Conference on Very Large Data Bases
Change-Centric Management of Versions in an XML Warehouse
Proceedings of the 27th International Conference on Very Large Data Bases
DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
From Searching Text to Querying XML Streams
SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
Efficient filtering of XML documents with XPath expressions
The VLDB Journal — The International Journal on Very Large Data Bases
Stream processing of XPath queries with predicates
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Achieving adaptivity for OLAP-XML federations
DOLAP '03 Proceedings of the 6th ACM international workshop on Data warehousing and OLAP
Path sharing and predicate evaluation for high-performance XML filtering
ACM Transactions on Database Systems (TODS)
A learning-based approach for fetching pages in WebVigiL
Proceedings of the 2004 ACM symposium on Applied computing
Optimizing the lazy DFA approach for XML stream processing
ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
Processing XML streams with deterministic automata and stream indexes
ACM Transactions on Database Systems (TODS)
Asymmetric Batch Incremental View Maintenance
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Sync your data: update propagation for heterogeneous protein databases
The VLDB Journal — The International Journal on Very Large Data Bases
The CQL continuous query language: semantic foundations and query execution
The VLDB Journal — The International Journal on Very Large Data Bases
Event-condition-action rules on RDF metadata in P2P environments
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
A dataflow approach to efficient change detection of HTML/XML documents in WebVigiL
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
Triggers over nested views of relational data
ACM Transactions on Database Systems (TODS)
Automaton meets algebra: a hybrid paradigm for XML stream processing
Data & Knowledge Engineering - Special issue: ER 2003
Framework for bringing data streams to the grid
Scientific Programming - AxGrids 2004
Optimizing XPath queries on streaming XML data
ADC '07 Proceedings of the eighteenth conference on Australasian database - Volume 63
Active XML: peer-to-peer data and web services integration
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Weaving temporal and reliability aspects into a schema tapestry
Data & Knowledge Engineering
Query processing for high-volume XML message brokering
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Distributed monitoring of peer to peer systems
Proceedings of the 9th annual ACM international workshop on Web information and data management
Value-based notification conditions in large-scale publish/subscribe systems?
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Information filtering and query indexing for an information retrieval model
ACM Transactions on Information Systems (TOIS)
Stream processing in data-driven computational science
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Managing an XML warehouse in a P2P context
CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
Processing queries in a large peer-to-peer system
CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
External ontologies in the semantic web
BNCOD'03 Proceedings of the 20th British national conference on Databases
PICSEL and Xyleme: two illustrative information integration agents
Intelligent information agents
CIMDIFF: advanced difference tracking tool for CIM compliant devices
LISA'09 Proceedings of the 23rd conference on Large installation system administration
Prefix-based node numbering for temporal XML
WISE'11 Proceedings of the 12th international conference on Web information system engineering
Online view maintenance under a response-time constraint
ESA'05 Proceedings of the 13th annual European conference on Algorithms
Ten theses on logic languages for the semantic web
PPSWR'05 Proceedings of the Third international conference on Principles and Practice of Semantic Web Reasoning
Schema-mediated exchange of temporal XML data
ER'06 Proceedings of the 25th international conference on Conceptual Modeling
Multidimensional integrated ontologies: a framework for designing semantic data warehouses
Journal on Data Semantics XIII
Flavours of XChange, a rule-based reactive language for the (semantic) web
RuleML'05 Proceedings of the First international conference on Rules and Rule Markup Languages for the Semantic Web
Survey: An overview on XML similarity: Background, current trends and future directions
Computer Science Review
Hi-index | 0.00 |
We consider the monitoring of a flow of incoming documents. More precisely, we present here the monitoring used in a very large warehouse built from XML documents found on the web. The flow of documents consists in XML pages (that are warehoused) and HTML pages (that are not). Our contributions are the following:a subscription language which specifies the monitoring of pages when fetched, the periodical evaluation of continuous queries and the production of XML reports.the description of the architecture of the system we implemented that makes it possible to monitor a flow of millions of pages per day with millions of subscriptions on a single PC, and scales up by using more machines.a new algorithm for processing alerts that can be used in a wider context.We support monitoring at the page level (e.g., discovery of a new page within a certain semantic domain) as well as at the element level (e.g., insertion of a new electronic product in a catalog).This work is part of the Xyleme system. Xyleme is developed on a cluster of PCs under Linux with Corba communications. The part of the system described in this paper has been implemented. We mention first experiments.