Query and tag translation for Chinese-Korean cross-language social media retrieval

Yu Chun Wang, Jian Ting Chen, Richard Tzong Han Tsai, Wen Lian Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Collaborative tagging has been widely adopted by social media websites to allow users to describe content with metadata tags. Tagging can greatly improve search results. We propose a cross-language social media retrieval system (CLSMR) to help users retrieve foreign-language tagged media content. We construct a Chinese to Korean CLSMR system that translates Chinese queries into Korean, retrieves content, and then translates the Korean tags in the search results back into Chinese. Our system translates NEs using a dictionary of bilingual NE pairs from Wikipedia and a pattern-based software translator which learns regular NE patterns from the web. The top-10 precision of YouTube retrieved results for our system was 0.39875. The K-C NE tag translation accuracy for the top-10 YouTube results was 77.6%, which shows that our translation method is fairly effective for named entities. A questionnaire given to users showed that automatically translated tags were considered as informative as a human-written summary. With our proposed CLSMR system, Chinese users can retrieve online Korean media files and get a basic understanding of their content with no knowledge of the Korean language.

Original languageEnglish
Title of host publicationProceedings of the 2011 IEEE International Conference on Information Reuse and Integration, IRI 2011
Pages288-291
Number of pages4
DOIs
StatePublished - 2011
Event12th IEEE International Conference on Information Reuse and Integration, IRI 2011 - Las Vegas, NV, United States
Duration: 3 Aug 20115 Aug 2011

Publication series

NameProceedings of the 2011 IEEE International Conference on Information Reuse and Integration, IRI 2011

Conference

Conference12th IEEE International Conference on Information Reuse and Integration, IRI 2011
Country/TerritoryUnited States
CityLas Vegas, NV
Period3/08/115/08/11

Fingerprint

Dive into the research topics of 'Query and tag translation for Chinese-Korean cross-language social media retrieval'. Together they form a unique fingerprint.

Cite this