Generating patterns for extracting Chinese-Korean named entity translations from the Web

Chih Hao Yeh, Wei Chi Tsai, Yu Chun Wang, Richard Tzong Han Tsai

Research output: Contribution to conferencePaperpeer-review

Abstract

One of the main difficulties in Chinese-Korean cross-language information retrieval is to translate named entities (NE) in queries. Unlike common words, most NE's are not found in bilingual dictionaries. This paper presents a pattern-based method of finding NE translations online. The most important feature of our system is that patterns are generated and weighed automatically, saving considerable human effort. Our experimental data consists of 160 Chinese-Korean NE pairs selected from Wikipedia in five domains. Our approach can achieve a very high MAP of 0.84, which demonstrates our system's practicability.

Original languageEnglish
Pages366-372
Number of pages7
StatePublished - 2008
Event20th Conference on Computational Linguistics and Speech Processing, ROCLING 2008 - Taipei, Taiwan
Duration: 4 Sep 20085 Sep 2008

Conference

Conference20th Conference on Computational Linguistics and Speech Processing, ROCLING 2008
Country/TerritoryTaiwan
CityTaipei
Period4/09/085/09/08

Keywords

  • Chinese-Korean named entity translation
  • Pattern
  • The Web

Fingerprint

Dive into the research topics of 'Generating patterns for extracting Chinese-Korean named entity translations from the Web'. Together they form a unique fingerprint.

Cite this