Clustering patients based on their HPO terms and corresponding graphs by use of different semantic similarity measures and graph similarity measures

dc.contributor.advisorDingemans, A.
dc.contributor.advisorHinne, M.
dc.contributor.authorSimmes, Nynke
dc.date.issued2022-01-27
dc.description.abstractFor many patients with intellectual disability (ID), no diagnosis is found after genetic testing. However, sometimes a mutation is found of which the effect is yet unknown: a variant of unknown significance (VUS). Classification or clustering of patients’ HPO terms i.e. phenotypic abnormalities and their corresponding graphs, might be very useful for therapies or recognition of (new) genetic disorders. In this study different semantic and graph similarity measures are applied to the HPO terms and their corresponding graphs of patients with genetic disorders. The measures examined are Resnik, Lin, GED and MCS. The resulting similarity matrices are used as input for the spectral clustering algorithm. Finally, among others, the adjusted rand index (ARI), a measure related to accuracy that can be used to assess clusters, is used to analyze which (combination of) measure(s) leads to the best clustering of patients such that it corresponds most to the actual disorders. We find that semantic similarity measures individually perform better than graph similarity measures. Also, combinations of measures or using evenly distributed data do not seem to lead to better results. A similarly improved method may in the future be useful in solving the VUS.
dc.identifier.urihttps://theses.ubn.ru.nl/handle/123456789/15995
dc.language.isoen
dc.thesis.facultyFaculteit der Sociale Wetenschappen
dc.thesis.specialisationspecialisations::Faculteit der Sociale Wetenschappen::Artificial Intelligence::Bachelor Artificial Intelligence
dc.thesis.studyprogrammestudyprogrammes::Faculteit der Sociale Wetenschappen::Artificial Intelligence
dc.thesis.typeBachelor
dc.titleClustering patients based on their HPO terms and corresponding graphs by use of different semantic similarity measures and graph similarity measures
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Simmes, N. s-1019900-BSc-Thesis-2022.pdf
Size:
772.36 KB
Format:
Adobe Portable Document Format