Cross-language article linking with different knowledge bases using bilingual topic model and translation features

Yu Chun Wang, Chun Kai Wu, Richard Tzong Han Tsai

研究成果: 雜誌貢獻期刊論文同行評審

7 引文 斯高帕斯(Scopus)

摘要

Creating links among online encyclopedia articles in different languages is crucial in the construction and integration of large multilingual knowledge bases. Most research to date has focused on linking among different language versions of Wikipedia, yet other large online encyclopedias in a variety of languages exist. In this work, we present a cross-language article-linking method using a bilingual topic model and translation features based on an SVM model to link articles in English Wikipedia and Chinese Baidu Baike, the most widely used Wiki-like encyclopedia in China. To evaluate our approach, we compile data sets from Baidu Baike articles and their corresponding English Wikipedia articles. The evaluation results show that our approach achieves at most 0.8158 in MRR, outperforming the baseline system by 0.1328 (+19.44%) in MRR. Our method does not heavily depend on linguistic characteristics, and it can be easily extended to generate cross-language article links among different online encyclopedias in other languages.

原文???core.languages.en_GB???
頁(從 - 到)228-236
頁數9
期刊Knowledge-Based Systems
111
DOIs
出版狀態已出版 - 1 11月 2016

指紋

深入研究「Cross-language article linking with different knowledge bases using bilingual topic model and translation features」主題。共同形成了獨特的指紋。

引用此