TY - JOUR
T1 - Predicting students’ academic performance using multiple linear regression and principal component analysis
AU - Yang, Stephen J.H.
AU - Lu, Owen H.T.
AU - Huang, Anna Yu Qing
AU - Huang, Jeff Cheng Hsu
AU - Ogata, Hiroaki
AU - Lin, Albert J.Q.
N1 - Publisher Copyright:
© 2018 Information Processing Society of Japan.
PY - 2018/1
Y1 - 2018/1
N2 - With the rise of big data analytics, learning analytics has become a major trend for improving the quality of education. Learning analytics is a methodology for helping students to succeed in the classroom; the principle is to predict student’s academic performance at an early stage and thus provide them with timely assistance. Accordingly, this study used multiple linear regression (MLR), a popular method of predicting students’ academic performance, to establish a prediction model. Moreover, we combined MLR with principal component analysis (PCA) to improve the predictive accuracy of the model. TraditionalMLR has certain drawbacks; specifically, the coefficient of determination (R2) and mean square error (MSE) measures and the quantile-quantile plot (Q-Q plot) technique cannot evaluate the predictive performance and accuracy of MLR. Therefore, we propose predictive MSE (pMSE) and predictive mean absolute percentage correction (pMAPC) measures for determining the predictive performance and accuracy of the regression model, respectively. Analysis results revealed that the proposed model for predicting students’ academic performance could obtain optimal pMSE and pMAPC values by using six components obtained from PCA.
AB - With the rise of big data analytics, learning analytics has become a major trend for improving the quality of education. Learning analytics is a methodology for helping students to succeed in the classroom; the principle is to predict student’s academic performance at an early stage and thus provide them with timely assistance. Accordingly, this study used multiple linear regression (MLR), a popular method of predicting students’ academic performance, to establish a prediction model. Moreover, we combined MLR with principal component analysis (PCA) to improve the predictive accuracy of the model. TraditionalMLR has certain drawbacks; specifically, the coefficient of determination (R2) and mean square error (MSE) measures and the quantile-quantile plot (Q-Q plot) technique cannot evaluate the predictive performance and accuracy of MLR. Therefore, we propose predictive MSE (pMSE) and predictive mean absolute percentage correction (pMAPC) measures for determining the predictive performance and accuracy of the regression model, respectively. Analysis results revealed that the proposed model for predicting students’ academic performance could obtain optimal pMSE and pMAPC values by using six components obtained from PCA.
KW - Learning analytics
KW - Multiple linear regression
KW - Principal component analysis
UR - http://www.scopus.com/inward/record.url?scp=85042135131&partnerID=8YFLogxK
U2 - 10.2197/ipsjjip.26.170
DO - 10.2197/ipsjjip.26.170
M3 - 期刊論文
AN - SCOPUS:85042135131
SN - 0387-5806
VL - 26
SP - 170
EP - 176
JO - Journal of Information Processing
JF - Journal of Information Processing
ER -