Enhance genomic IR with term variation and expansion: Experiences of the IASL group at genomic track 2005

Tzong Han Tsai, Chia Wei Wu, Hsieh Chuan Hung, Yu Chun Wang, Ding He, Yi Feng Lin, Cheng Wei Lee, Ting Yi Sung, Wen Lian Hsu

研究成果: 雜誌貢獻會議論文同行評審

摘要

The rapid increase of biomedical literature available on the web has made it increasingly difficult to find precise information. To implement an accurate biomedical information retrieval (IR) system, we must deal with the variants of biomedical terms carefully. In this paper, we focus on the generation of aliases, synonyms, acronyms, and lexical variants of such terms. In addition, we also propose a hyphen handling technique for processing hyphenated terms. We use the original terms/phrases, and expanded terms/phrases to construct an Indri query, and evaluate the effectiveness of various methods by two indicators: MAP, and recall. Our experiment results show that tackling hyphenation improves information retrieval significantly. In addition, synonym expansion also enhances IR performance when the focus of a query is identified. For a natural language query, deep semantic analysis and more knowledge-oriented expansion should be applied.

原文???core.languages.en_GB???
期刊NIST Special Publication
出版狀態已出版 - 2005
事件14th Text REtrieval Conference, TREC 2005 - Gaithersburg, MD, United States
持續時間: 15 11月 200518 11月 2005

指紋

深入研究「Enhance genomic IR with term variation and expansion: Experiences of the IASL group at genomic track 2005」主題。共同形成了獨特的指紋。

引用此