Enhance genomic IR with term variation and expansion: Experiences of the IASL group at genomic track 2005

Tzong Han Tsai, Chia Wei Wu, Hsieh Chuan Hung, Yu Chun Wang, Ding He, Yi Feng Lin, Cheng Wei Lee, Ting Yi Sung, Wen Lian Hsu

Research output: Contribution to journalConference articlepeer-review

Abstract

The rapid increase of biomedical literature available on the web has made it increasingly difficult to find precise information. To implement an accurate biomedical information retrieval (IR) system, we must deal with the variants of biomedical terms carefully. In this paper, we focus on the generation of aliases, synonyms, acronyms, and lexical variants of such terms. In addition, we also propose a hyphen handling technique for processing hyphenated terms. We use the original terms/phrases, and expanded terms/phrases to construct an Indri query, and evaluate the effectiveness of various methods by two indicators: MAP, and recall. Our experiment results show that tackling hyphenation improves information retrieval significantly. In addition, synonym expansion also enhances IR performance when the focus of a query is identified. For a natural language query, deep semantic analysis and more knowledge-oriented expansion should be applied.

Original languageEnglish
JournalNIST Special Publication
StatePublished - 2005
Event14th Text REtrieval Conference, TREC 2005 - Gaithersburg, MD, United States
Duration: 15 Nov 200518 Nov 2005

Keywords

  • Biomedical literature
  • Information retrieval
  • Lexical variation
  • Query expansion

Fingerprint

Dive into the research topics of 'Enhance genomic IR with term variation and expansion: Experiences of the IASL group at genomic track 2005'. Together they form a unique fingerprint.

Cite this