TY - CONF
T1 - The construction of a Chinese named entity tagged corpus
T2 - CNEC1.0
AU - Shih, Cheng Wei
AU - Tsai, Tzong Han
AU - Wu, Shih Hung
AU - Hsieh, Chiu Chen
AU - Hsu, Wen Lian
N1 - Publisher Copyright:
© Proc. of the 16th Conference on Computational Linguistics and Speech Processing, ROCLING 2004. All rights reserved.
PY - 2021
Y1 - 2021
N2 - In order to build an automatic named entity recognition (NER) system for machine learning, a large tagged corpus is necessary. This paper describes the manual construction of a Chinese named entity tagged corpus (CNEC 1.0) that can be used to improve NER performance. In this project, we define five named entity tags: PER (person name), LOC (location name), ORG (organization name), LAO (location as organization), and OAL (organization as location) for named entity categories. In addition, we propose a special tag, DIFF (Difficulty), to annotate ambiguous cases during corpus construction. A, corpus-annotating procedure, a tagging tool, and an original corpus are also introduced. Finally, we demonstrate a part of our manual-tagged corpus.
AB - In order to build an automatic named entity recognition (NER) system for machine learning, a large tagged corpus is necessary. This paper describes the manual construction of a Chinese named entity tagged corpus (CNEC 1.0) that can be used to improve NER performance. In this project, we define five named entity tags: PER (person name), LOC (location name), ORG (organization name), LAO (location as organization), and OAL (organization as location) for named entity categories. In addition, we propose a special tag, DIFF (Difficulty), to annotate ambiguous cases during corpus construction. A, corpus-annotating procedure, a tagging tool, and an original corpus are also introduced. Finally, we demonstrate a part of our manual-tagged corpus.
UR - http://www.scopus.com/inward/record.url?scp=85117174162&partnerID=8YFLogxK
M3 - 會議論文
AN - SCOPUS:85117174162
ER -