Neural Networks and Glimpses for Speech-in-Noise Understanding

Peek, Kiara

Neural Networks and Glimpses for Speech-in-Noise Understanding

dc.contributor.advisor	Heijden, van der, Kiki
dc.contributor.advisor	Fitz, H.
dc.contributor.author	Peek, Kiara
dc.date.issued	2022-07-12
dc.description.abstract	Humans use glimpses to identify speech in noise. However, Automatic Speech Recognition (ASR) systems often look at signal-to-noise ratios (SNRs) as a predictor for speech intelligibility. This research extends the studies by Zhu et al. and Cooke et al. by evaluating the importance of glimpses in noisy environments and the performance of an artificial neural network. The existing wav2vec 2.0 model by Baevski et al. is used to test the performance of this model on both clean and noisy speech, followed by an analysis of glimpses. Results show that there is a strong positive correlation between the word accuracy and the glimpse ratio which indicates that neural networks rely on glimpses for speech-in-noise understanding. It is also shown that glimpses are a better predictor for word accuracies than signalto- noise ratios and that glimpses contribute more to the understanding of non-stationary- than stationary- noise types.
dc.identifier.uri	https://theses.ubn.ru.nl/handle/123456789/15984
dc.language.iso	en
dc.thesis.faculty	Faculteit der Sociale Wetenschappen
dc.thesis.specialisation	specialisations::Faculteit der Sociale Wetenschappen::Artificial Intelligence::Bachelor Artificial Intelligence
dc.thesis.studyprogramme	studyprogrammes::Faculteit der Sociale Wetenschappen::Artificial Intelligence
dc.thesis.type	Bachelor
dc.title	Neural Networks and Glimpses for Speech-in-Noise Understanding

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Peek. K. s-1018439-BSc-Thesis-2022.pdf
Size:: 848.65 KB
Format:: Adobe Portable Document Format

Download

Collections

Faculteit der Sociale Wetenschappen