The application of transformer-based language models in WSD

Visser, Jorrit

The application of transformer-based language models in WSD

Files

Visser Jorrit 4453336 BA Thesis.pdf (702.29 KB)

Authors

Visser, Jorrit

Issue Date

2024-09-24

Language

en

URI

https://theses.ubn.ru.nl/handle/123456789/18218

Abstract

Word Sense Disambiguation (WSD) is the process of automatically linking semantically ambiguous information expressed through language to categorizations of senses. WSD is often based on distributional information gathered from word context, represented as vector embeddings. Transformer-based language models provide Contextualized Word Embeddings (CWEs) which enable disambiguation of polysemous and homonymous tokens. In this study, CWEs are produced by BERT for ambiguous tokens appearing in SemCor. Clustering is applied to find groups of similar CWEs. These clusters are mapped to the SemCor annotation. The effects of the entropy of sense inventories and of syntactic classes within them are tested. Additionally, word suggestions produced by RoBERTa are aggregated into ‘R-lists’ which represent each group. These are evaluated for informativity. Entropy has a significant negative effect on accuracy. Syntactic entropy has a positive effect on accuracy, but not within syntactically ambiguous words. R-lists are shown to provide a reasonable degree of informativity.

Supervisor

Halteren, van , B.J.M.

Oostdijk, N.H.J.

Faculty

Faculteit der Letteren

Programme

Bachelor Taalwetenschap

Specialisation

Bachelor Taalwetenschap

Collections

Faculteit der Letteren

Full item page

The application of transformer-based language models in WSD

Keywords

Files

Authors

Issue Date

Language

Document type

Journal Title

Journal ISSN

Volume Title

Publisher

Title

ISSN

Volume

Issue

Startpage

Endpage

URI

DOI

Abstract

Description

Citation

Supervisor

Faculty

Programme

Specialisation

Collections