Reconstructing Speech Input from Convolutional Neural Network Activity

Churchman, T.J.

Reconstructing Speech Input from Convolutional Neural Network Activity

dc.contributor.advisor	Gerven, M.A.J. van
dc.contributor.advisor	Güçlü, U.
dc.contributor.author	Churchman, T.J.
dc.date.issued	2015-07-13
dc.description.abstract	Convolutional Neural Networks (CNNs) applied to the auditory domain have achieved great results. However, little research has been performed to uncover the underlying mechanisms that allow auditory CNNs to reach these achievements. This exploratory research attempts to help uncover these mechanisms by using a CNN's activation patterns generated by speech inputs to reconstruct those inputs. It is found that training a decoder to decode activity patterns to a preliminary reconstruction and consequently fine-tuning that reconstruction through further back propagation generates the best results. The reconstructions show that the network preserves a good representation of the input up to and including the fully connected units. Reinforcingly, this representation appears to be suited to speech, instead of, for example, audio in general. Furthermore, it becomes apparent that the network is insensitive to input intensity as well as to the input's activation scale on the time-domain. Further research is required to discover more properties of CNNs applied to the auditory domain.	en_US
dc.identifier.uri	http://theses.ubn.ru.nl/handle/123456789/259
dc.language.iso	en	en_US
dc.thesis.faculty	Faculteit der Sociale Wetenschappen	en_US
dc.thesis.specialisation	Bachelor Artificial Intelligence	en_US
dc.thesis.studyprogramme	Artificial Intelligence	en_US
dc.thesis.type	Bachelor	en_US
dc.title	Reconstructing Speech Input from Convolutional Neural Network Activity	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Churchman, T.,_BA_Thesis_2015.pdf
Size:: 719.73 KB
Format:: Adobe Portable Document Format
Description:: Scriptietekst

Download

Collections

Faculteit der Sociale Wetenschappen