Developing learner corpus annotation for Chinese grammatical errors

Lung Hao Lee, Li Ping Chang, Yuen Hsien Tseng

研究成果: 書貢獻/報告類型會議論文篇章同行評審

12 引文 斯高帕斯(Scopus)

摘要

This study describes the construction of the TOCFL (Test Of Chinese as a Foreign Language) learner corpus, including the collection and grammatical error annotation of 2,837 essays written by Chinese language learners originating from a total of 46 different mother-Tongue languages. We propose hierarchical tagging sets to manually annotate grammatical errors, resulting in 33,835 inappropriate usages. Our built corpus has been provided for the shared tasks on Chinese grammatical error diagnosis. These demonstrate the usability of our learner corpus annotation.

原文???core.languages.en_GB???
主出版物標題Proceedings of the 2016 International Conference on Asian Language Processing, IALP 2016
編輯Minghui Dong, Chung-Hsien Wu, Yanfeng Lu, Haizhou Li, Yuen-Hsien Tseng, Liang-Chih Yu, Lung-Hao Lee
發行者Institute of Electrical and Electronics Engineers Inc.
頁面254-257
頁數4
ISBN(電子)9781509009213
DOIs
出版狀態已出版 - 10 3月 2017
事件20th International Conference on Asian Language Processing, IALP 2016 - Tainan, Taiwan
持續時間: 21 11月 201623 11月 2016

出版系列

名字Proceedings of the 2016 International Conference on Asian Language Processing, IALP 2016

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???20th International Conference on Asian Language Processing, IALP 2016
國家/地區Taiwan
城市Tainan
期間21/11/1623/11/16

指紋

深入研究「Developing learner corpus annotation for Chinese grammatical errors」主題。共同形成了獨特的指紋。

引用此