Anda belum login :: 23 Nov 2024 10:58 WIB
Detail
ArtikelShifting Attention Using a Temporal Difference Prediction Error and High-Dimensional Input  
Oleh: Alexander, William H.
Jenis: Article from Journal - e-Journal
Dalam koleksi: Adaptive Behavior vol. 15 no. 2 (Jul. 2007), page 121–133.
Topik: attention; temporal difference learning; reinforcement learning; representation; adaptive behavior
Fulltext: 121.pdf (637.52KB)
Isi artikelResearch on reinforcement learning has increasingly focused on the role of neuromodulatory systems implicated in associative learning. Formulations of temporal difference (TD) learning have gained a great deal of attention due to the similarity of the TD prediction error and the observed activity of dopamine neurons in the primate midbrain. Recent work has attempted to integrate additional neuromodulatory systems such as noradrenaline and acetylcholine in a TD framework. Additional work has been done to remedy representational issues arising from TD variants that result in incorrect predictions of dopamine activity, as well as to incorporate the TD error signal in models of categorization. In this paper, an actor–critic model incorporating aspects of TD learning and psychological models of attention is described. The development of the model and the behavior of an autonomous agent in a simulated environment are examined and compared with a variant of TD learning lacking an attentional component. The agent learns to behave adaptively due to the shifting of attention to relevant aspects of a high-dimensional input. In contrast, the TD model exhibits perseverative behavior and comparatively slow learning in the same context. It is suggested that real-time models of attention may provide insight into neuromodulatory systems implicated in attention and representational learning.
Opini AndaKlik untuk menuliskan opini Anda tentang koleksi ini!

Kembali
design
 
Process time: 0.015625 second(s)