Anda belum login :: 23 Nov 2024 15:32 WIB
Detail
ArtikelOptimal Control Using the Transport Equation: The Liouville Machine  
Oleh: Kwee, Ivo ; Schmidhuber, Jurgen
Jenis: Article from Journal - e-Journal
Dalam koleksi: Adaptive Behavior vol. 9 no. 2 (Jun. 2001), page 105–118.
Topik: machine learning; shortest path; optimal control; transport theory
Fulltext: 105.pdf (531.58KB)
Isi artikelTransport theory describes the scattering behavior of physical particles such as photons. Here we show how to connect this theory to optimal control theory and to adaptive behavior of agents embedded in an environment. Environments and tasks are defined by physical boundary conditions. Given some task, we compute a set of probability densities on continuous state and action and time. From these densities we derive an optimal policy such that for all states the most likely action maximizes the probability of reaching a predefined goal state. Liouville’s conservation theorem tells us that the conditional density at time t, state s, and action a must equal the density at t + dt, s + ds, a + da. Discretization yields a linear system that can be solved directly and whose solution corresponds to an optimal policy. Discounted reward schemes are incorporated naturally by taking the Laplace transform of the equations. The Liouville machine quickly solves rather complex maze problems.
Opini AndaKlik untuk menuliskan opini Anda tentang koleksi ini!

Kembali
design
 
Process time: 0.03125 second(s)