Impact of teachers’ grading policy on the identification of at-risk students in learning analytics

Owen H.T. Lu, Anna Y.Q. Huang, Stephen J.H. Yang

研究成果: 雜誌貢獻期刊論文同行評審

19 引文 斯高帕斯(Scopus)


The purpose of learning analytics is to promote student success in the classroom. To implement the framework of learning analytics, researchers have adopted machine learning methodologies to identify at-risk students at an early stage. In theory, machine learning is a mathematical algorithm that improves automation through experience. The experience is the data collected from online learning platforms, and in general, the data contain various features such as the number of times that a student accesses the learning material each week. Relevant studies have demonstrated extremely high accuracy in identifying at-risk students using identification models trained by machine learning. However, numerous details and data challenges have been overlooked in prior studies, calling into question the accuracy of past contributions. In this study, we focused on one type of data challenge: data imbalance. The data imbalance problems in education are usually the result of teachers’ grading policy. To highlight the seriousness of this issue, we collected data from 12 blended learning courses and summarized 3 types of grading policies: discrimination, stringency, and leniency. We then provided evidence that the leniency strategy causes the illusion of high accuracy of at-risk student identification. Finally, we verified a robust method to address the effectiveness of the leniency strategy, and using these results, we summarized the characteristics of students who tend to be misidentified by machine learning methodology.

期刊Computers and Education
出版狀態已出版 - 4月 2021


深入研究「Impact of teachers’ grading policy on the identification of at-risk students in learning analytics」主題。共同形成了獨特的指紋。