Methods For Automatically Generating a Legal Thesaurus

Vos, Hugo P. de

Methods For Automatically Generating a Legal Thesaurus

Files

Vos, de H.P. s.4193695-Rema thesis CLS 2017.pdf (843.4 KB)

Authors

Vos, Hugo P. de

Issue Date

2017-08-31

Language

en

URI

http://theses.ubn.ru.nl/handle/123456789/5027

Abstract

Automatic thesaurus generation is a desired technique for the reason that a thesaurus is a useful tool in NLP, but manually making a thesaurus is expensive and time consuming. In this thesis, the process of thesaurus generation is divided up in two parts: term extraction and relation extraction. Term extraction being the process of automatically finding candidate terms for a legal thesaurus and relation extraction is the process of finding which terms are hypernyms of each other. For term extraction different termhood measures are used: Log Likelihood, Kullback Leibler Divergence and the measure as assigned by the TExSIS tool. For relation extraction, different classifiers are trained to classify whether two terms have a hypernym-relation. The conclusion of this thesis is that no system could be built that can autonomously build a thesaurus and that in the short term it is better to look for a system to assist humans in making a thesaurus.

Supervisor

Hendrickx, I.H.E.

Kunneman, F.A.

Faculty

Faculteit der Letteren

Programme

Researchmasters

Specialisation

Researchmaster Language and Communication

Collections

Faculteit der Letteren

Full item page

Methods For Automatically Generating a Legal Thesaurus

Keywords

Files

Authors

Issue Date

Language

Document type

Journal Title

Journal ISSN

Volume Title

Publisher

Title

ISSN

Volume

Issue

Startpage

Endpage

URI

DOI

Abstract

Description

Citation

Supervisor

Faculty

Programme

Specialisation

Collections