Assessing the Impact of Feature Selection on Neural Network Performance in Audio Dialect Identification

Keywords
Loading...
Thumbnail Image
Issue Date
2024-11-03
Language
en
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
This study aimed to investigate the role of feature selection in the performance of neural networks for audio dialect identification, and to ascertain which features are most effective for capturing dialect-specific information across different dialects and phonemes. While previous research, such as that conducted by Tawaqal and Suyanto (2021), has demonstrated accuracy rates of over 80% with Mel-Frequency Cepstral Coefficients (MFCC), this study found less satisfactory results, with the highest accuracy score reaching 60%. The type of feature vectors that were significant were the number of frames around the vowel nucleus, the number of to be classified dialects and the different vowels. Non-significant type of feature vectors were parameters connected to the network (type, number of layers), inclusion of derivative and MFCC parameters.
Description
Citation
Faculty
Faculteit der Letteren