Reconstructing Images and Audio: Multi-modal autoencoders

Keywords
No Thumbnail Available
Date
2020-07-01
Language
en
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
In this thesis, a multi-modal auto-encoder is built that reconstructs both images and audio. The goal is to build a multi-modal auto-encoder that is capable of learning a shared representation between images of digits and audio of the pronunciation of the digits. This model, while fairly accurate on digits, does not perform very well on the audio data.
Description
Citation
Faculty
Faculteit der Sociale Wetenschappen