Reconstructing Images and Audio: Multi-modal autoencoders

van der Linden, N. R. L.

Reconstructing Images and Audio: Multi-modal autoencoders

Files

4716795 Linden vd.pdf (426.5 KB)

Authors

van der Linden, N. R. L.

Issue Date

2020-07-01

Language

en

URI

https://theses.ubn.ru.nl/handle/123456789/12733

Abstract

In this thesis, a multi-modal auto-encoder is built that reconstructs both images and audio. The goal is to build a multi-modal auto-encoder that is capable of learning a shared representation between images of digits and audio of the pronunciation of the digits. This model, while fairly accurate on digits, does not perform very well on the audio data.

Supervisor

Lanillos Pradas, P. L.

Faculty

Faculteit der Sociale Wetenschappen

Programme

Artificial Intelligence

Specialisation

Bachelor Artificial Intelligence

Collections

Faculteit der Sociale Wetenschappen

Full item page

Reconstructing Images and Audio: Multi-modal autoencoders

Keywords

Files

Authors

Issue Date

Language

Document type

Journal Title

Journal ISSN

Volume Title

Publisher

Title

ISSN

Volume

Issue

Startpage

Endpage

URI

DOI

Abstract

Description

Citation

Supervisor

Faculty

Programme

Specialisation

Collections