A general and multi-lingual phrase chunking model based on masking method

Yu Chieh Wu, Chia Hui Chang, Yue Shi Lee

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

21 Scopus citations

Abstract

Several phrase chunkers have been proposed over the past few years. Some state-of-the-art chunkers achieved better performance via integrating external resources, e.g., parsers and additional training data, or combining multiple learners. However, in many languages and domains, such external materials are not easily available and the combination of multiple learners will increase the cost of training and testing. In this paper, we propose a mask method to improve the chunking accuracy. The experimental results show that our chunker achieves better performance in comparison with other deep parsers and chunkers. For CoNLL-2000 data set, our system achieves 94.12 in F rate. For the base-chunking task, our system reaches 92.95 in F rate. When porting to Chinese, the performance of the base-chunking task is 92.36 in F rate. Also, our chunker is quite efficient. The complete chunking time of a 50K words document is about 50 seconds.

Original languageEnglish
Title of host publicationComputational Linguistics and Intelligent Text Processing - 7th International Conference, CICLing 2006, Proceedings
PublisherSpringer Verlag
Pages144-155
Number of pages12
ISBN (Print)3540322051, 9783540322054
DOIs
StatePublished - 2006
Event7th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2006 - Mexico City, Mexico
Duration: 19 Feb 200625 Feb 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3878 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2006
Country/TerritoryMexico
CityMexico City
Period19/02/0625/02/06

Fingerprint

Dive into the research topics of 'A general and multi-lingual phrase chunking model based on masking method'. Together they form a unique fingerprint.

Cite this