Reinforcement Learning with History Lists: Solving Partially Observable Decision Processes by Using Short Term Memory - Stephan Timmer - Livros - Suedwestdeutscher Verlag fuer Hochschuls - 9783838106212 - 1 de abril de 2009

Caso a capa e o título não sejam correspondentes, considere o título como correto

Conte aos seus amigos sobre este item:

Stephan Timmer
Reinforcement Learning with History Lists: Solving Partially Observable Decision Processes by Using Short Term Memory Stephan Timmer

Name: Reinforcement Learning with History Lists: Solving Partially Observable Decision Processes by Using Short Term Memory
Price: 59.99 EUR
Availability: OutOfStock
Author: Stephan Timmer

Preço

€ 59,99

Item sob encomenda (no estoque do fornecedor)

Data prevista de entrega 10 - 18 de ago

Receba avisos sobre novos lançamentos de Stephan Timmer

O que dizem nossos clientes:

Top-vurdering på Google Reviews, baseret på tusinder af anmeldelser.

Política de devolução de 14 dias, em conformidade com a lei europeia de proteção do consumidor

Melhor classificação na Trustpilot

Adicione à sua lista de desejos do iMusic

Ainda não avaliado

Reinforcement Learning with History Lists: Solving Partially Observable Decision Processes by Using Short Term Memory

Stephan Timmer

A very general framework for modeling uncertainty in learning environments is given by Partially observable Markov Decision Processes (POMDPs). In a POMDP setting, the learning agent infers a policy for acting optimally in all possible states of the environment, while receiving only observations of these states. The basic idea for coping with partial observability is to include memory into the representation of the policy. Perfect memory is provided by the belief space, i.e. the space of probability distributions over environmental states. However, computing policies defined on the belief space requires a considerable amount of prior knowledge about the learning problem and is expensive in terms of computation time. The author Stephan Timmer presents a reinforcement learning algorithm for solving POMDPs based on short term memory. In contrast to belief states, short term memory is not capable of representing optimal policies, but is far more practical and requires no prior knowledge about the learning problem. It can be shown that the algorithm can also be used to solve large Markov Decision Processes (MDPs) with continuous, multi-dimensional state spaces.

Mídia	Livros Paperback Book (Livro de capa flexível e brochura)
Lançado	1 de abril de 2009
ISBN13	9783838106212
Editoras	Suedwestdeutscher Verlag fuer Hochschuls
Páginas	160
Dimensões	150 × 220 × 10 mm · 256 g
Idioma	Alemão