Visualizing Breast Cancer Data with t-SNE
One in eight women will get breast cancer in her lifetime and in 2008 it has caused 458.503 deaths among the world . Despite that technology has made considerable improvements in the last decades, there is still room for more advances. A technique that possibly can contribute to this field is t-SNE . The aim of this thesis is to investigate whether t-SNE is able to present the breast cancer data in an interpretable way and possibly improves the classification performances. We employ two approaches to explore the applicability of t-SNE. In the first approach we compare the visualizations and in the second approach the classification performances are compared. We found that classification on the original data per- formed significantly better than on t-SNE data. This suggests that t-SNE is not applicable to the breast cancer data set.
Faculteit der Sociale Wetenschappen