Unleashing the Horde

Planting, T.

Unleashing the Horde

Files

Planting, T,Ba-Thesis2012.pdf (924.56 KB)

Authors

Planting, T.

Issue Date

2012-08-31

Language

en

URI

http://theses.ubn.ru.nl/handle/123456789/110

Abstract

Sutton et al. (2011) proposed a new reinforcement learning architecture called Horde. This architecture is based on the parallel processing of sensorimotor data. We have implemented the Horde architecture in our own simulated environment. This environment is a simple continuous MDP. We have done this to investigate if the paper provides enough information to reconstruct the algorithm, which design choices that are not explicitly mentioned in the paper have to be made to reconstruct the architecture, and whether the algorithm generalizes well to a different simulation environment. Several tests were run and the results were analyzed. Keywords: artificial intelligence, knowledge representation, real-time, reinforcement learning, o -policy learning, temporal difference learning, value function approximation, tile coding, general value functions, GQ( ), predictions, parallel processing

Supervisor

Otterlo, M. van

Sprinkhuizen-Kuyper, I.G.

Faculty

Faculteit der Sociale Wetenschappen

Programme

Artificial Intelligence

Specialisation

Bachelor Artificial Intelligence

Collections

Faculteit der Sociale Wetenschappen

Full item page

Unleashing the Horde

Keywords

Files

Authors

Issue Date

Language

Document type

Journal Title

Journal ISSN

Volume Title

Publisher

Title

ISSN

Volume

Issue

Startpage

Endpage

URI

DOI

Abstract

Description

Citation

Supervisor

Faculty

Programme

Specialisation

Collections