TY - JOUR
T1 - A Semi-automatic Error Retrieval Method for Uncovering Collocation Errors from a Large Learner Corpus
AU - Yang, Christine Ting Yu
AU - Chen, Howard Hao Jan
AU - Liu, Chen Yu
AU - Liu, Yu Cheng
N1 - Publisher Copyright:
© 2019, National Taiwan Normal University.
PY - 2020/3/1
Y1 - 2020/3/1
N2 - Previous studies on ESL/EFL learners’ verb-noun (V-N) miscollocations have shed some light on common miscollocation types and possible causes. However, barriers to further understanding of learners’ difficulties still exist, such as the limited amount of learner data generated from small corpora and the labor-intensive process of manually retrieving collocational errors. To provide researchers with a more efficient retrieval method, this study proposed the use of the Sketch-Diff function in the Sketch Engine (SKE) platform to semi-automatically retrieve collocation errors in large learner corpora. To test the feasibility of this semi-automatic retrieval method, a 7.4-million-word EFL learner corpus was investigated with Sketch-Diff, and 4541 tokens of common miscollocations were identified. Analysis of these miscollocations revealed that most errors were verb-based and often caused by negative transfer from the learners’ L1, undergeneralization (e.g., ignorance of L2 syntactic rules), and approximation (e.g., the misuse of near-synonyms, hyper-/hyponyms, antonyms, and lexemes with similar sound/form). This study demonstrates that using Sketch-Diff to retrieve V-N miscollocations from a large learner corpus is both feasible and efficient. This method can be applied to other languages to further deepen our understanding of L2 learners’ difficulties in collocation acquisition.
AB - Previous studies on ESL/EFL learners’ verb-noun (V-N) miscollocations have shed some light on common miscollocation types and possible causes. However, barriers to further understanding of learners’ difficulties still exist, such as the limited amount of learner data generated from small corpora and the labor-intensive process of manually retrieving collocational errors. To provide researchers with a more efficient retrieval method, this study proposed the use of the Sketch-Diff function in the Sketch Engine (SKE) platform to semi-automatically retrieve collocation errors in large learner corpora. To test the feasibility of this semi-automatic retrieval method, a 7.4-million-word EFL learner corpus was investigated with Sketch-Diff, and 4541 tokens of common miscollocations were identified. Analysis of these miscollocations revealed that most errors were verb-based and often caused by negative transfer from the learners’ L1, undergeneralization (e.g., ignorance of L2 syntactic rules), and approximation (e.g., the misuse of near-synonyms, hyper-/hyponyms, antonyms, and lexemes with similar sound/form). This study demonstrates that using Sketch-Diff to retrieve V-N miscollocations from a large learner corpus is both feasible and efficient. This method can be applied to other languages to further deepen our understanding of L2 learners’ difficulties in collocation acquisition.
KW - Collocation
KW - Error analysis
KW - Error extraction method
KW - Learner corpus
UR - http://www.scopus.com/inward/record.url?scp=85075154320&partnerID=8YFLogxK
U2 - 10.1007/s42321-019-00037-y
DO - 10.1007/s42321-019-00037-y
M3 - 期刊論文
AN - SCOPUS:85075154320
SN - 1023-7267
VL - 44
SP - 1
EP - 19
JO - English Teaching and Learning
JF - English Teaching and Learning
IS - 1
ER -