Abstract
In this paper, we describe the construction details of a confused character set for Chinese spell checking. The SIGHAN 2013-2015 bakeoff datasets are adopted to measure the performance of correct character suggestions. Our confusion set significantly outperforms the existing confusion set in candidate selection for automatic spelling checkers.
| Original language | English |
|---|---|
| Title of host publication | ICCE 2019 - 27th International Conference on Computers in Education, Proceedings |
| Editors | Maiga Chang, Hyo-Jeong So, Lung-Hsiang Wong, Fu-Yun Yu, Ju-Ling Shih, Ivica Boticki, Ming-Puu Chen, Ali Dewan, Stian Haklev, Elizabeth Koh, Tomoko Kojiri, Kuo-Chen Li, Daner Sun, Yun Wen |
| Publisher | Asia-Pacific Society for Computers in Education |
| Pages | 703-705 |
| Number of pages | 3 |
| ISBN (Electronic) | 9789869721431 |
| State | Published - 19 Nov 2019 |
| Event | 27th International Conference on Computers in Education, ICCE 2019 - Kenting, Taiwan Duration: 2 Dec 2019 → 6 Dec 2019 |
Publication series
| Name | ICCE 2019 - 27th International Conference on Computers in Education, Proceedings |
|---|---|
| Volume | 1 |
Conference
| Conference | 27th International Conference on Computers in Education, ICCE 2019 |
|---|---|
| Country/Territory | Taiwan |
| City | Kenting |
| Period | 2/12/19 → 6/12/19 |
Keywords
- Chinese spell checking
- Confusion set
- Pronunciation similarity
- Shape similarity
Fingerprint
Dive into the research topics of 'Building a confused character set for Chinese spell checking'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Chinese Knowledge Base Construction and Applications for Medical Healthcare Domain(1/3)
Lee, L.-H. (PI)
1/05/19 → 30/04/20
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver