Topic Modeling with Word2Vec based Noun Expansion for Dark Web Marketplace Analysis
dc.contributor.advisor | Dr. Grootjen, F.A. | |
dc.contributor.advisor | Dr. de Boer, M.H.T.(external) | |
dc.contributor.advisor | Dr. Joosten, B.(external) | |
dc.contributor.author | Vogelzang, B. | |
dc.date.issued | 2019-10-11 | |
dc.description.abstract | A novel approach for expanding documents is proposed to improve topic modeling on short text. The enrichment is based on expanding noun words with information from custom (e.g. domain-speci c) and pretrained Word2Vec models. The quality of the di erent conditions: original, custom and pretrained, are evaluated with manual analysis of the created topics and with the classi cation performance of a Suport Vector Machine trained on the output of an LDA system. Manual analysis did not show a striking improvement of the created topics with the enriched texts, compared to the original text. The performance of the prediction models show a improved performance, only when enriched with information from the custom Word2Vec models. However, the extent of the improvement is dependent on the text domain. | en_US |
dc.embargo.lift | 10000-01-01 | |
dc.embargo.type | Permanent embargo | en_US |
dc.identifier.uri | https://theses.ubn.ru.nl/handle/123456789/10697 | |
dc.language.iso | en | en_US |
dc.thesis.faculty | Faculteit der Sociale Wetenschappen | en_US |
dc.thesis.specialisation | Master Artificial Intelligence | en_US |
dc.thesis.studyprogramme | Artificial Intelligence | en_US |
dc.thesis.type | Master | en_US |
dc.title | Topic Modeling with Word2Vec based Noun Expansion for Dark Web Marketplace Analysis | en_US |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- S4368487_VogelzangB 2018.pdf
- Size:
- 1.06 MB
- Format:
- Adobe Portable Document Format