Anda belum login :: 23 Nov 2024 23:18 WIB
Detail
BukuEvolution and Learning: Evolving Sensors in a Simple MDP Environment
Bibliografi
Author: Jung, Tobias ; Dauscher, Peter (Co-Author); Uthmann, Thomas (Co-Author)
Topik: sensor evolution; adaptive; relevant information; cognitive economy; state generalization; reinforcement learning
Bahasa: (EN )    
Penerbit: SAGE Publications     Tempat Terbit: London    Tahun Terbit: 2003    
Jenis: Article - untuk jurnal ilmiah
Fulltext: 179AB113.pdf (628.0KB; 0 download)
Abstract
Natural intelligence and autonomous agents face difficulties when acting in information-dense environments. Assailed by a multitude of stimuli they have to make sense of the inflow of information, filtering and processing what is necessary, but discarding that which is unimportant. This paper aims at investigating the interactions between evolution of the sensorial channel extracting the information from the environment and the simultaneous individual adaptation of agent-control. Our particular goal is to study the influence of learning on the evolution of sensors, with learning duration being the tunable parameter. A genetic algorithm governs the evolution of sensors appropriate for the agent solving a simple grid world task. The performance of the agent is taken as fitness; ?sensors? are conceived as a map from environmental states to agent observations, and individual adaptation is modeled by Qlearning. Our experimental results show that due to the principles of cognitive economy learning and varying the degree thereof actually transforms the fitness landscape. In particular we identify a tradeoff between learning speed (load) and sensor accuracy (error). These results are further reinforced by theoretical analysis: we derive an analytical measure for the quality of sensors based on the mutual entropy between the system of states and the selection of an optimal action, a concept recently proposed by Polani, Martinetz, and Kim.
Opini AndaKlik untuk menuliskan opini Anda tentang koleksi ini!

Lihat Sejarah Pengadaan  Konversi Metadata   Kembali
design
 
Process time: 0.203125 second(s)