Japanese-chinese information retrieval with an iterative weighting scheme

Chu Cheng Lin, Y. U.Chun Wang, Richard Tzong Han Tsai

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

This paper describes our Japanese-Chinese cross language information retrieval system. We adopt query-translation approach and employ both a conventional JapaneseChinese bilingual dictionary and Wikipedia to translate query terms. We propose that Wikipedia can be regarded as a good dictionary for named entity translation. According to the nature of Japanese writing system, we propose that query terms should be processed differently based on their written forms. We use an iterative method for weighttuning and term disambiguation, which is based on the PageRank algorithm. When evaluating on the NTCIR-5 test set, our system achieves as high as 0.2217 and 0.2276 in relax MAP (Mean Average Precision) measurement of T-runs and D-runs.

Original languageEnglish
Pages (from-to)685-697
Number of pages13
JournalJournal of Information Science and Engineering
Volume26
Issue number2
StatePublished - Mar 2010

Keywords

  • Iterative term weighting
  • Japanese-chinese cross-language information retrieval
  • Named entity translation
  • Query disambiguation
  • Query-translation

Fingerprint

Dive into the research topics of 'Japanese-chinese information retrieval with an iterative weighting scheme'. Together they form a unique fingerprint.

Cite this