TY - JOUR
T1 - An attention enhanced sentence feature network for subtitle extraction and summarization
AU - Chootong, Chalothon
AU - Shih, Timothy K.
AU - Ochirbat, Ankhtuya
AU - Sommool, Worapot
AU - Zhuang, Yung Yu
N1 - Publisher Copyright:
© 2021 Elsevier Ltd
PY - 2021/9/15
Y1 - 2021/9/15
N2 - An automatic subtitle summarization of videos not only aims to tackle the problem of content overloading but can also improve the performance of video retrieval, allowing viewers to efficiently access and understand the main content of a video. However, subtitle summarization is a challenging task due to documents being composed of incomplete sentences, meaningless phrases, and informal language. In this paper, we introduce a novel multiple attention mechanism for subtitle summarization to address such issues. We take advantage of both Convolutional Neural Networks (CNNs) and Bidirectional Long Short-Term Memory (Bi-LSTM) Networks to capture the critical information of the sentence that is used to identify the importance of the sentence. Based on the salient sentence score, we introduce the summary generation method to produce a summary of the video. The experiments are conducted on both subtitle documents from educational videos and text documents. To the best of our knowledge, no previous studies have applied multiple-attention mechanisms for summarizing educational videos. Besides, we experiment on two well-known text document datasets, DUC2002, and CNN/Daily Mail, to test the performance of our model. We utilize ROUGE measures for evaluating the generated summaries at 95% confidence intervals. The experimental results demonstrated that our model outperforms the baseline and state-of-the-art models on the ROUGE-1, ROUGE-2, and ROUGE-L scores.
AB - An automatic subtitle summarization of videos not only aims to tackle the problem of content overloading but can also improve the performance of video retrieval, allowing viewers to efficiently access and understand the main content of a video. However, subtitle summarization is a challenging task due to documents being composed of incomplete sentences, meaningless phrases, and informal language. In this paper, we introduce a novel multiple attention mechanism for subtitle summarization to address such issues. We take advantage of both Convolutional Neural Networks (CNNs) and Bidirectional Long Short-Term Memory (Bi-LSTM) Networks to capture the critical information of the sentence that is used to identify the importance of the sentence. Based on the salient sentence score, we introduce the summary generation method to produce a summary of the video. The experiments are conducted on both subtitle documents from educational videos and text documents. To the best of our knowledge, no previous studies have applied multiple-attention mechanisms for summarizing educational videos. Besides, we experiment on two well-known text document datasets, DUC2002, and CNN/Daily Mail, to test the performance of our model. We utilize ROUGE measures for evaluating the generated summaries at 95% confidence intervals. The experimental results demonstrated that our model outperforms the baseline and state-of-the-art models on the ROUGE-1, ROUGE-2, and ROUGE-L scores.
KW - Educational video
KW - Extractive summarization
KW - Integration CNN-LSTM
KW - Multiple attention mechanism
KW - Subtitle summarization
UR - http://www.scopus.com/inward/record.url?scp=85104342823&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2021.114946
DO - 10.1016/j.eswa.2021.114946
M3 - 期刊論文
AN - SCOPUS:85104342823
SN - 0957-4174
VL - 178
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 114946
ER -