THE EFFICIENCY OF ADVERSARIAL STATE EMBEDDINGS IN MODEL-BASED RL TASKS THE EFFECTS OF USING LATENT SPACES OF ADVERSARIAL NETWORKS TO ENCODE VISUAL INPUT FOR THE WORLD MODELS ARCHITECTURE

Keywords
Loading...
Thumbnail Image
Issue Date
2020-02-12
Language
en
Document type
Journal Title
Journal ISSN
Volume Title
Publisher
Title
ISSN
Volume
Issue
Startpage
Endpage
DOI
Abstract
Unravelling all the information received from our surroundings is key to understand and interact with the outside world. The outcome of this operation determines the performance of any other step in the reasoning processes, thus making it a crucial element of any agent's learning experience. This thesis explored how di erent representations of visual information a ected the ability of the World Models architecture to nd an optimal policy using online, o ine and hybrid training procedures. To that end, we replaced its original perception module with other alternatives that, with the same neural architecture, imposed different biases and de ned di erent elements of interest. We attempted to promote disentanglement using -VAE, and let the model de ne important high-level features in an adversarial fashion using VAE-GAN. We proved that VAE-GAN can be an alternative to traditional autoencoders when encoding visual input in a reinforcement learning setup. Not only that, but this technique improved the nal performance of several of our con gurations. To the best of our knowledge, this was the rst piece of work that has ever used an adversarial architecture to encode sensory input for a reinforcement learning task. Additionally, we were able to test theWorld Models architecture on a new training procedure that alternated both training in the real world and inside the model's imagination. Unfortunately, we could not directly encourage the creation of disentangled latent spaces under the current con guration, but we still provided a qualitative analysis of this characteristic for all our approaches. Keywords Visual embeddings; knowledge representation; model-based reinforcement learning; generative adversarial networks; World Models; VAE; VAE-GAN
Description
Citation
Faculty
Faculteit der Sociale Wetenschappen