TY - JOUR
T1 - An automated learning content classification model for open education repositories
T2 - Case of MERLOT II
AU - Gunarathne, W. K.T.M.
AU - Shih, Timothy K.
AU - Chootong, Chalothon
AU - Sommool, Worapot
AU - Ochirbat, Ankhtuya
N1 - Publisher Copyright:
© 2020 Taiwan Academic Network Management Committee. All rights reserved.
PY - 2020
Y1 - 2020
N2 - The value of OERs mainly depends on how easy they can be searched or located through a web search engine. Currently, the MERLOT II metadata repository requests resource providers to choose the relevant discipline category manually while adding material to its repository. This practice appears very time-consuming and also bound to involve human errors. If a member picks an incorrect discipline category, then the learning resource may not be correctly categorized in the repository. This situation may result in a learning resource to be not shortlisted for a given keyword search of the “MERLOT Smart Search” or in the “Advanced search.” Above investigations motivated us to recognize the importance of developing an automated learning content classification solution for OER repositories. In this study, we proposed a novel automated learning content classification model (LCCM) to classify learning resources into relevant discipline categories while adding them to the MERLOT repository. The research goal incorporated in this paper include dataset preparation, data preprocessing, feature extraction using LDA topic model, and calculating the semantic similarity using a pre-trained word embedding matrix. These methods serve as a base for classifying learning resources more effectively within a short time.
AB - The value of OERs mainly depends on how easy they can be searched or located through a web search engine. Currently, the MERLOT II metadata repository requests resource providers to choose the relevant discipline category manually while adding material to its repository. This practice appears very time-consuming and also bound to involve human errors. If a member picks an incorrect discipline category, then the learning resource may not be correctly categorized in the repository. This situation may result in a learning resource to be not shortlisted for a given keyword search of the “MERLOT Smart Search” or in the “Advanced search.” Above investigations motivated us to recognize the importance of developing an automated learning content classification solution for OER repositories. In this study, we proposed a novel automated learning content classification model (LCCM) to classify learning resources into relevant discipline categories while adding them to the MERLOT repository. The research goal incorporated in this paper include dataset preparation, data preprocessing, feature extraction using LDA topic model, and calculating the semantic similarity using a pre-trained word embedding matrix. These methods serve as a base for classifying learning resources more effectively within a short time.
KW - Automatic learning object classification
KW - Multi-label classification
KW - Open Educational Resources
KW - Topic models
UR - http://www.scopus.com/inward/record.url?scp=85097151319&partnerID=8YFLogxK
U2 - 10.3966/160792642020092105005
DO - 10.3966/160792642020092105005
M3 - 期刊論文
AN - SCOPUS:85097151319
SN - 1607-9264
VL - 21
SP - 1277
EP - 1288
JO - Journal of Internet Technology
JF - Journal of Internet Technology
IS - 5
ER -