Evaluating hallucinations and repair in open-domain dialogue systems.

Wouw, I. van de

Evaluating hallucinations and repair in open-domain dialogue systems.

Files

Wouw, Ino van de 1021920-BaThesis.pdf (898.14 KB)

Authors

Wouw, I. van de

Issue Date

2023-01-28

Language

en

URI

https://theses.ubn.ru.nl/handle/123456789/15463

Abstract

This study investigates the repair strategies employed by large language models, during conversations with human interactors and the influence these strategies have on the human interactor’s perception. A corpus of 1123 conversations was collected and analysed, as well as a survey of 14 respondents. The results indicate that the chatbot is limited in its ability to resolve conversational errors and that hallucinations had no adverse influence on user experience. This research has implications for the development of open-domain dialogue systems and conversational agents in the form of evaluation metrics that can be used in order to create a realistic understanding of the capabilities of this technology.

Supervisor

Liesenfeld, A.M.

Thesing, C.

Faculty

Faculteit der Letteren