TY - JOUR
T1 - Effectively learn how to learn
T2 - a novel few-shot learning with meta-gradient memory
AU - Hui, Lin
AU - Chen, Yi Cheng
N1 - Publisher Copyright:
© 2024 Inderscience Enterprises Ltd.
PY - 2024
Y1 - 2024
N2 - Recently, the importance of few-shot learning has tremendously grown due to its widespread applicability. Via few-shot learning, users can train their models with few data and maintain high generalisation ability. Meta-learning and continual learning models have demonstrated elegant performance in model development. However, unstable performance and catastrophic forgetting are still two fatal issues with regard to retaining the memory of knowledge about previous tasks when facing new tasks. In this paper, a novel method, enhanced model-agnostic meta-learning (EN-MAML), is proposed for blending the flexible adaptation characteristics of meta-learning and the stable performance of continual learning to tackle the above problems. Based on the proposed learning method, users can efficiently and effectively train the model in a stable manner with few data. Experiments show that when following the N-way K-shot experimental protocol, EN-MAML has higher accuracy, more stable performance and faster convergence than other state-of-the-art models on several real datasets.
AB - Recently, the importance of few-shot learning has tremendously grown due to its widespread applicability. Via few-shot learning, users can train their models with few data and maintain high generalisation ability. Meta-learning and continual learning models have demonstrated elegant performance in model development. However, unstable performance and catastrophic forgetting are still two fatal issues with regard to retaining the memory of knowledge about previous tasks when facing new tasks. In this paper, a novel method, enhanced model-agnostic meta-learning (EN-MAML), is proposed for blending the flexible adaptation characteristics of meta-learning and the stable performance of continual learning to tackle the above problems. Based on the proposed learning method, users can efficiently and effectively train the model in a stable manner with few data. Experiments show that when following the N-way K-shot experimental protocol, EN-MAML has higher accuracy, more stable performance and faster convergence than other state-of-the-art models on several real datasets.
KW - continual learning
KW - deep learning
KW - machine learning
KW - meta-learning
UR - http://www.scopus.com/inward/record.url?scp=85188862342&partnerID=8YFLogxK
U2 - 10.1504/IJWGS.2024.137549
DO - 10.1504/IJWGS.2024.137549
M3 - 期刊論文
AN - SCOPUS:85188862342
SN - 1741-1106
VL - 20
SP - 3
EP - 24
JO - International Journal of Web and Grid Services
JF - International Journal of Web and Grid Services
IS - 1
ER -