Apparent Personality Prediction using Multimodal Residual Networks with 3D Convolution

Keywords
Loading...
Thumbnail Image
Authors
Issue Date
2018-07-01
Language
nl
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
In this thesis we propose a 3D apparent personality prediction model as extension of the multimodal residual neural network used for first impression analysis by Güçlütürk et al. [1]. The original model was trained on audio-visual data from YouTube videos and predicts the Big Five personality traits of the people in the video. The auditory data and the visual data were randomly selected within a clip, and thus not synchronized. The novel contribution of this research is to study the effect of extending the visual information over multiple frames, and of synchronizing the two modalities on the performance of the model. The model architecture was adapted to include these changes, and several new models were trained. Each performed better than the baseline models trained on the same dataset. Moreover, we provide evidence that temporal information improves the performance. However, a different network architecture is needed to prove the effect of the synchronization.
Description
Citation
Faculty
Faculteit der Sociale Wetenschappen