Heuristic Dynamic Programming Using Echo State Network As Online Trainable Adaptive Critic

Petia Koprinkova-Hristova,Mohamed Oubbati,Guenther Palm

International journal of adaptive control and signal processing（2012）

引用 17|浏览11

暂无评分

摘要

SUMMARYThe present paper proposes an implementation of a relatively new recurrent neural network architecture—the echo state network (ESN)–within the frame of heuristic dynamic programming. The ESN is trained online to estimate the utility function and to adapt the control policy of an embodied agent. With the advantage of an easy training algorithm, the ESN architecture offers a simple way to calculate the derivatives required for adapting the controller. Experimental results are provided to validate the proposed learning approach. Copyright © 2012 John Wiley & Sons, Ltd.

查看译文

关键词

adaptive critic design (ACD),heuristic dynamic programming (HDP),echo state network (ESN)

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要