Efficient searches for similar subsequences of different lengths in sequence databases

Sanghyun Park, Wesley W. Chu, Jeehee Yoon, Chihcheng Hsu

研究成果: 會議貢獻類型會議論文同行評審

124 引文 斯高帕斯(Scopus)

摘要

We propose an indexing technique for fast retrieval of similar subsequences using time warping distances. A time warping distance is a more suitable similarity measure than the Euclidean distance in many applications, where sequences may be of different lengths or different sampling rates. Our indexing technique uses a disk-based suffix tree as an index structure and employs lower-bound distance functions to filter out dissimilar subsequences without false dismissals. To make the index structure compact and thus accelerate the query processing, we convert sequences of continuous values to sequences of discrete values via a categorization method and store only a subset of suffixes whose first values are different from their preceding values. The experimental results reveal that our proposed technique can be a few orders of magnitude faster than sequential scanning.

原文???core.languages.en_GB???
頁面23-32
頁數10
出版狀態已出版 - 2000
事件2000 IEEE 16th International Conference on Data Engineering (ICDE'00) - San Diego, CA, USA
持續時間: 29 2月 20003 3月 2000

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2000 IEEE 16th International Conference on Data Engineering (ICDE'00)
城市San Diego, CA, USA
期間29/02/003/03/00

指紋

深入研究「Efficient searches for similar subsequences of different lengths in sequence databases」主題。共同形成了獨特的指紋。

引用此