A deferred cleansing method for RFID data analytics

Authors:
Jun Rao;Sangeeta Doraiswamy;Hetal Thakkar;Latha S. Colby
Affiliations:
IBM Almaden Research Center;IBM Almaden Research Center;UCLA;IBM Almaden Research Center
Venue:
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Year:
2006

Citing 15
Cited 35

Adapting materialized views after redefinitions

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Optimization techniques for queries with expensive methods

ACM Transactions on Database Systems (TODS)
Consistent query answers in inconsistent databases

PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Optimization of sequence queries in database systems

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
On the decidability and complexity of query answering over inconsistent and incomplete databases

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Robust and efficient fuzzy match for online data cleaning

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
ConQuer: efficient management of inconsistent databases

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Data cleaning in microsoft SQL server 2005

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Temporal management of RFID data

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Supporting RFID-based item tracking applications in Oracle DBMS using a bitmap datatype

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Warehousing and Analyzing Massive RFID Data Sets

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Model-driven data acquisition in sensor networks

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Integrating automatic data acquisition with business processes experiences with SAP's auto-ID infrastructure

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Managing RFID data

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Declarative support for sensor data cleaning

PERVASIVE'06 Proceedings of the 4th international conference on Pervasive Computing

On impact-oriented automatic resolution of pervasive context inconsistency

Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
On impact-oriented automatic resolution of pervasive context inconsistency

The 6th Joint Meeting on European software engineering conference and the ACM SIGSOFT symposium on the foundations of software engineering: companion papers
Managing RFID data in supply chains

International Journal of Internet Protocol Technology
Testing pervasive software in the presence of context inconsistency resolution services

Proceedings of the 30th international conference on Software engineering
Efficient storage scheme and query processing for supply chain management using RFID

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Cascadia: A System for Specifying, Detecting, and Managing RFID Events

Proceedings of the 6th international conference on Mobile systems, applications, and services
Reducing false reads in RFID-embedded supply chains

Journal of Theoretical and Applied Electronic Commerce Research
Anomaly-free incremental output in stream processing

Proceedings of the 17th ACM conference on Information and knowledge management
Identifying RFID-embedded objects in pervasive healthcare applications

Decision Support Systems
Efficient RFID Data Imputation by Analyzing the Correlations of Monitored Objects

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
RFID Data Aggregation

GSN '09 Proceedings of the 3rd International Conference on GeoSensor Networks
Complex RFID event processing

The VLDB Journal — The International Journal on Very Large Data Bases
Partial constraint checking for context consistency in pervasive computing

ACM Transactions on Software Engineering and Methodology (TOSEM)
Finding misplaced items in retail by clustering RFID data

Proceedings of the 13th International Conference on Extending Database Technology
Real-time event handling in an RFID middleware system

DNIS'07 Proceedings of the 5th international conference on Databases in networked information systems
Fast track article: A temporal RFID data model for querying physical objects

Pervasive and Mobile Computing
Leveraging spatio-temporal redundancy for RFID data cleansing

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Applying a neural network to recover missed RFID readings

ACSC '10 Proceedings of the Thirty-Third Australasian Conferenc on Computer Science - Volume 102
Correcting missing data anomalies with clausal defeasible logic

ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
Ubiquitous RFID: Where are we?

Information Systems Frontiers
Distributed inference and query processing for RFID tracking and monitoring

Proceedings of the VLDB Endowment
Complex event processing over unreliable RFID data streams

APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
Leveraging communication information among readers for RFID data cleaning

WAIM'11 Proceedings of the 12th international conference on Web-age information management
Managing RFID events in large-scale distributed RFID infrastructures

Information Technology and Management
A novel integrated classifier for handling data warehouse anomalies

ADBIS'11 Proceedings of the 15th international conference on Advances in databases and information systems
Developing RFID database models for analysing moving tags in supply chain management

ER'11 Proceedings of the 30th international conference on Conceptual modeling
X-CleLo: intelligent deterministic RFID data and event transformer

Personal and Ubiquitous Computing
Belief based data cleaning for wireless sensor networks

Wireless Communications & Mobile Computing
An approximate duplicate elimination in RFID data streams

Data & Knowledge Engineering
Adam: Identifying defects in context-aware adaptation

Journal of Systems and Software
A model-based approach for RFID data stream cleansing

Proceedings of the 21st ACM international conference on Information and knowledge management
Asynchronous event detection for context inconsistency in pervasive computing

International Journal of Ad Hoc and Ubiquitous Computing
Evaluating the performance of a discrete manufacturing process using RFID: A case study

Robotics and Computer-Integrated Manufacturing
Challenges in developing software for cyber-physical systems

Proceedings of the 5th Asia-Pacific Symposium on Internetware
An intelligent approach to handle False-Positive Radio Frequency Identification Anomalies

Intelligent Data Analysis

Quantified Score

Hi-index	0.02

Visualization

Abstract

Radio Frequency Identification is gaining broader adoption in many areas. One of the challenges in implementing an RFID-based system is dealing with anomalies in RFID reads. A small number of anomalies can translate into large errors in analytical results. Conventional "eager" approaches cleanse all data upfront and then apply queries on cleaned data. However, this approach is not feasible when several applications define anomalies and corrections on the same data set differently and not all anomalies can be defined beforehand. This necessitates anomaly handling at query time. We introduce a deferred approach for detecting and correcting RFID data anomalies. Each application specifies the detection and the correction of relevant anomalies using declarative sequence-based rules. An application query is then automatically rewritten based on the cleansing rules that the application has specified, to provide answers over cleaned data. We show that a naive approach to deferred cleansing that applies rules without leveraging query information can be prohibitive. We develop two novel rewrite methods, both of which reduce the amount of data to be cleaned, by exploiting predicates in application queries while guaranteeing correct answers. We leverage standardized SQL/OLAP functionality to implement rules specified in a declarative sequence-based language. This allows efficient evaluation of cleansing rules using existing query processing capabilities of a DBMS. Our experimental results show that deferred cleansing is affordable for typical analytic queries over RFID data.