The design of Chinese character learning system based on phonetic components

Chia Hui Chang, Wen Pen Wu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

An increasing number of people learn Chinese as second language in the world. About 60% of Chinese characters are picto-phonetic compounds which are composed of a phonetic component (PC) and semantic component. Therefore, one can make a guess at a character's pronunciation and meaning from its phonetic and semantic component for a new character. For this reason, we propose an order of phonetic components based on pronunciation strength, frequency and number of strokes for efficient learning with proper pronunciation rules and graph recognition. We adopt stem-deriving instructional method which extends each phonetic component with different radical component to derive new picto-phonetic compounds of similar pronunciation. Via simulation, the top 400 phonetic components and their picto-phonetic extensions are enough for the recognition of 60% characters in general articles; and top 800 phonetic components can help recognition of 90% characters of general news articles.

Original languageEnglish
Title of host publicationProceedings of the 24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012
Pages112-124
Number of pages13
StatePublished - 2012
Event24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012 - Chung-Li, Taiwan
Duration: 21 Sep 201222 Sep 2012

Publication series

NameProceedings of the 24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012

Conference

Conference24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012
Country/TerritoryTaiwan
CityChung-Li
Period21/09/1222/09/12

Keywords

  • Component
  • Phonetic component
  • Picto-phonetic compounds
  • Stem-deriving instructional method

Fingerprint

Dive into the research topics of 'The design of Chinese character learning system based on phonetic components'. Together they form a unique fingerprint.

Cite this