Abstract
This paper describes our Japanese-Chinese cross language information retrieval system. We adopt query-translation approach and employ both a conventional JapaneseChinese bilingual dictionary and Wikipedia to translate query terms. We propose that Wikipedia can be regarded as a good dictionary for named entity translation. According to the nature of Japanese writing system, we propose that query terms should be processed differently based on their written forms. We use an iterative method for weighttuning and term disambiguation, which is based on the PageRank algorithm. When evaluating on the NTCIR-5 test set, our system achieves as high as 0.2217 and 0.2276 in relax MAP (Mean Average Precision) measurement of T-runs and D-runs.
Original language | English |
---|---|
Pages (from-to) | 685-697 |
Number of pages | 13 |
Journal | Journal of Information Science and Engineering |
Volume | 26 |
Issue number | 2 |
State | Published - Mar 2010 |
Keywords
- Iterative term weighting
- Japanese-chinese cross-language information retrieval
- Named entity translation
- Query disambiguation
- Query-translation