Collective web-based parenthetical translation extraction using markov logic networks

Research output: Contribution to journalArticlepeer-review

Abstract

Parenthetical translations are translations of terms in otherwise monolingual text that appear inside parentheses. Parenthetical translations extraction (PTE) is the task of extracting parenthetical translations from natural language documents. One of the main difficulties in PTE is to detect the left boundary of the translated term in preparenthetical text. In this article, we propose a collective approach that employs Markov logic to model multiple constraints used in the PTE task. We show how various constraints can be formulated and combined in a Markov logic network (MLN). Our experimental results show that the proposed collective PTE approach significantly outperforms a current state-of-the-art method, improving the average F-measure up to 27.11% compared to the previous word alignment approach. It also outperforms an individual MLN-based system by 8.2% and a system based on conditional random fields by 5.9%.

Original languageEnglish
Article number7
JournalACM Transactions on Asian and Low-Resource Language Information Processing
Volume15
Issue number2
DOIs
StatePublished - Dec 2015

Keywords

  • Entity translation
  • Markov logic network
  • Named entity translation
  • Parenthetical translation extraction

Cite this