TY - JOUR
T1 - A bayesian inference-based framework for RFID data cleansing
AU - Ku, Wei Shinn
AU - Chen, Haiquan
AU - Wang, Haixun
AU - Sun, Min Te
PY - 2013
Y1 - 2013
N2 - The past few years have witnessed the emergence of an increasing number of applications for tracking and tracing based on radio frequency identification (RFID) technologies. However, raw RFID readings are usually of low quality and may contain numerous anomalies. An ideal solution for RFID data cleansing should address the following issues. First, in many applications, duplicate readings of the same object are very common. The solution should take advantage of the resulting data redundancy for data cleaning. Second, prior knowledge about the environment may help improve data quality, and a desired solution must be able to take into account such knowledge. Third, the solution should take advantage of physical constraints in target applications to elevate the accuracy of data cleansing. There are several existing RFID data cleansing techniques. However, none of them support all the aforementioned features. In this paper, we propose a Bayesian inference-based framework for cleaning RFID raw data. We first design an (n)-state detection model and formally prove that the three-state model can maximize the system performance. Then, we extend the (n)-state model to support two-dimensional RFID reader arrays and compute the likelihood efficiently. In addition, we devise a Metropolis-Hastings sampler with constraints, which incorporates constraint management to clean RFID data with high efficiency and accuracy. Moreover, to support real-time object monitoring, we present the streaming Bayesian inference method to cope with real-time RFID data streams. Finally, we evaluate the performance of our solutions through extensive experiments.
AB - The past few years have witnessed the emergence of an increasing number of applications for tracking and tracing based on radio frequency identification (RFID) technologies. However, raw RFID readings are usually of low quality and may contain numerous anomalies. An ideal solution for RFID data cleansing should address the following issues. First, in many applications, duplicate readings of the same object are very common. The solution should take advantage of the resulting data redundancy for data cleaning. Second, prior knowledge about the environment may help improve data quality, and a desired solution must be able to take into account such knowledge. Third, the solution should take advantage of physical constraints in target applications to elevate the accuracy of data cleansing. There are several existing RFID data cleansing techniques. However, none of them support all the aforementioned features. In this paper, we propose a Bayesian inference-based framework for cleaning RFID raw data. We first design an (n)-state detection model and formally prove that the three-state model can maximize the system performance. Then, we extend the (n)-state model to support two-dimensional RFID reader arrays and compute the likelihood efficiently. In addition, we devise a Metropolis-Hastings sampler with constraints, which incorporates constraint management to clean RFID data with high efficiency and accuracy. Moreover, to support real-time object monitoring, we present the streaming Bayesian inference method to cope with real-time RFID data streams. Finally, we evaluate the performance of our solutions through extensive experiments.
KW - Data cleaning
KW - probabilistic algorithms
KW - spatiotemporal databases
KW - uncertainty
UR - http://www.scopus.com/inward/record.url?scp=84883292723&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2012.116
DO - 10.1109/TKDE.2012.116
M3 - 期刊論文
AN - SCOPUS:84883292723
SN - 1041-4347
VL - 25
SP - 2177
EP - 2191
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 10
M1 - 6216377
ER -