Memory Reduction through Experience Classification f or Deep Reinforcement Learning with Prioritized Experience Replay

Kai Huan Shen, Pei Yun Tsai

研究成果: 書貢獻/報告類型會議論文篇章同行評審

2 引文 斯高帕斯(Scopus)

摘要

Prioritized experience replay has been widely used in many online reinforcement learning algorithms, providing high efficiency in exploiting past experiences. However, a large replay buffer consumes system storage significantly. Thus, in this paper, a segmentation and classification scheme is proposed. The distribution of temporal-difference errors (TD errors) is first segmented. The experience for network training is classified according to its updated TD error. Then, a swap mechanism for similar experiences is implemented to change the lifetimes of experiences in the replay buffer. The proposed scheme is incorporated in the Deep Deterministic Policy Gradient (DDPG) algorithm, and the Inverted Pendulum and Inverted Double Pendulum tasks are used for verification. From the experiments, our proposed mechanism can effectively remove the buffer redundancy and further reduce the correlation of experiences in the replay buffer. Thus, better learning performance with reduced memory size is achieved at the cost of additional computations of updated TD errors.

原文???core.languages.en_GB???
主出版物標題2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019
發行者Institute of Electrical and Electronics Engineers Inc.
頁面166-171
頁數6
ISBN(電子)9781728119274
DOIs
出版狀態已出版 - 10月 2019
事件33rd IEEE International Workshop on Signal Processing Systems, SiPS 2019 - Nanjing, China
持續時間: 20 10月 201923 10月 2019

出版系列

名字IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation
2019-October
ISSN(列印)1520-6130

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???33rd IEEE International Workshop on Signal Processing Systems, SiPS 2019
國家/地區China
城市Nanjing
期間20/10/1923/10/19

指紋

深入研究「Memory Reduction through Experience Classification f or Deep Reinforcement Learning with Prioritized Experience Replay」主題。共同形成了獨特的指紋。

引用此