Indexing semistructured data using PATRICIA tree

Li Cheng Wu, Jorng Tzong Horng, Baw Jhiune Liu, Chin Yea Wang, Gwo Dong Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Information on the Web like HTML documents with images, video, and sound is a collection of heterogeneous data. HTML documents are semistructured in nature. Semistructured data are used to describe those structures which are less rigid or regular than those data found in standard database systems. This study presents a novel means of using Patricia Tree [14] to index semistructured data. This index is used by transferring the query into a regular expression and querying the regular expression over the Patricia Tree. The highlight of this approach is supporting query on content and structure simultaneously, while also supporting fast query time on long path and regular expressions.

Original languageEnglish
Title of host publicationDatabase and Expert Systems Applications - 11th International Conference, DEXA 2000, Proceedings
EditorsMohamed Ibrahim, Josef Kung, Norman Revell
PublisherSpringer Verlag
Pages859-868
Number of pages10
ISBN (Print)9783540679783
DOIs
StatePublished - 2000
Event11th International Conference on Database and Expert Systems Applications, DEXA 2000 - London, United Kingdom
Duration: 4 Sep 20008 Sep 2000

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1873
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Conference on Database and Expert Systems Applications, DEXA 2000
Country/TerritoryUnited Kingdom
CityLondon
Period4/09/008/09/00

Fingerprint

Dive into the research topics of 'Indexing semistructured data using PATRICIA tree'. Together they form a unique fingerprint.

Cite this