Organizing Flickr30k Using Text Clustering

dc.contributor.advisor	Kachergis, G.E.
dc.contributor.advisor	Grootjen, F.A.
dc.contributor.author	Güclü, I.
dc.date.issued	2018-06-18
dc.description.abstract	Text clustering is the process of clustering similar documents together based on the textual information within a document. The captions provided with the Flickr30k dataset will be used to organize the images. The dataset consists of captioned images of everyday life. The two approaches to clustering (hierarchical and partitional) will be implemented to assess the formed clusters. K-means and agglomerative clustering will be used to experiment with. The performance of the two algorithms will be assessed using internal validity measurements. The difference between the two algorithms was too small to judge which one performed better. However the clusters that are formed did differ. K-means made a distinction between ‘adult people’ vs. ‘young people’, agglomerative clustering made a distinction between ‘people’ vs. ‘bullfighting’.	en_US
dc.embargo.lift	10000-01-01
dc.embargo.type	Permanent embargo	en_US
dc.identifier.uri	https://theses.ubn.ru.nl/handle/123456789/7033
dc.language.iso	en	en_US
dc.thesis.faculty	Faculteit der Sociale Wetenschappen	en_US
dc.thesis.specialisation	Bachelor Artificial Intelligence	en_US
dc.thesis.studyprogramme	Artificial Intelligence	en_US
dc.thesis.type	Bachelor	en_US
dc.title	Organizing Flickr30k Using Text Clustering	en_US

Files

Now showing 1 - 1 of 1