Applying genetic algorithms to query optimization in document retrieval

Jorng Tzong Horng, Ching Chang Yeh

研究成果: 雜誌貢獻期刊論文同行評審

105 引文 斯高帕斯(Scopus)

摘要

This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. One of the contributions of the paper is to combine the Bigram model and PAT-tree structure to retrieve keywords. The approach extracts bigrams from documents and uses the bigrams to construct a PAT-tree to retrieve keywords. The proposed approach can retrieve any type of keywords such as technical keywords and a person's name. Effectiveness of the proposed approach is demonstrated by comparing how effective are the keywords found by both this approach and the PAT-tree based approach. This comparison reveals that our keyword retrieval approach is as accurate as the PAT-tree based approach, yet our approach is faster and uses less memory. The study then applies genetic algorithms to tune the weight of retrieved keywords. Moreover, several documents obtained from web sites are tested and experimental results are compared with those of other approaches, indicating that the proposed approach is highly promising for applications.

原文???core.languages.en_GB???
頁(從 - 到)737-759
頁數23
期刊Information Processing and Management
36
發行號5
DOIs
出版狀態已出版 - 1 9月 2000

指紋

深入研究「Applying genetic algorithms to query optimization in document retrieval」主題。共同形成了獨特的指紋。

引用此