Text-based video genre classification using multiple feature categories and categorization methods

dc.contributor.advisorBosch, A.P.J. van den
dc.contributor.advisorHendrickx, I.H.E.
dc.contributor.authorLee, Chris G. van der
dc.date.issued2017-07-13
dc.description.abstractThe aim of this work is to categorize movies into genres using text-based features. Textual, syntactical and content-specific features are extracted from subtitles in the SUBTIEL corpus. The effectiveness of these three feature types is then compared using five algorithms (AdaBoost, C4.5, Naive Bayes, Random Forest, and SVM) and four methods are tested to combine these features (supervector, add-rule meta-classifier, product-rule meta-classifier, algorithm-based meta-classifier). The experimental results show that of the three feature types, the content-specific features result in the most accurate classifier. Furthermore, it is found that the Random Forest and SVM techniques are the two most accurate algorithms and that combining the textual, syntactical and content-specific features results in a more accurate classifier. However, the effectiveness of combining these three classifiers is largely dependent on the combination method: the algorithm-based meta classifier yields the largest improvement over the individual feature type classifiers.en_US
dc.identifier.urihttp://theses.ubn.ru.nl/handle/123456789/5021
dc.language.isoenen_US
dc.thesis.facultyFaculteit der Letterenen_US
dc.thesis.specialisationResearchmaster Language and Communicationen_US
dc.thesis.studyprogrammeResearchmastersen_US
dc.thesis.typeResearchmasteren_US
dc.titleText-based video genre classification using multiple feature categories and categorization methodsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Chris van der Lee s4000528 ReMA scriptie 2017.pdf
Size:
1.4 MB
Format:
Adobe Portable Document Format