Connecting the Demons: How connection choices of a Horde implementation affect Demon prediction capabilities.

Waa, J.S. van der

Connecting the Demons: How connection choices of a Horde implementation affect Demon prediction capabilities.

Files

Waa van der J,BA_2013.pdf (2.09 MB)

Authors

Waa, J.S. van der

Issue Date

2013-08-26

Language

en

URI

http://theses.ubn.ru.nl/handle/123456789/137

Abstract

The reinforcement learning framework Horde, developed by Sutton et al. [9], is a network of Demons that processes sensorimotor data to general knowledge about the world. These Demons can be connected to each other and to data-streams from specific sensors. This paper will focus on how and if the capability of Demons to learn general knowledge is affected by different numbers of connections with both other Demons and sensors. Several experiments and tests where done and analyzed to map these effects and to provide insight in how these effects arose. Keywords: Artificial Intelligence, value function approximation, temporal difference learning, reinforcement learning, predictions, prediction error, pendulum environment, parallel processing, offpolicy learning, network connections, knowledge representation, Horde Architecture, GQ( ), general value functions

Supervisor

Otterlo, M. van

Sprinkhuizen-Kuyper, I.G.

Faculty

Faculteit der Sociale Wetenschappen

Programme

Artificial Intelligence

Specialisation

Bachelor Artificial Intelligence

Collections

Faculteit der Sociale Wetenschappen

Full item page

Connecting the Demons: How connection choices of a Horde implementation affect Demon prediction capabilities.

Keywords

Files

Authors

Issue Date

Language

Document type

Journal Title

Journal ISSN

Volume Title

Publisher

Title

ISSN

Volume

Issue

Startpage

Endpage

URI

DOI

Abstract

Description

Citation

Supervisor

Faculty

Programme

Specialisation

Collections