Evaluating hallucinations and repair in open-domain dialogue systems.

Keywords
Loading...
Thumbnail Image
Issue Date
2023-01-28
Language
en
Document type
Journal Title
Journal ISSN
Volume Title
Publisher
Title
ISSN
Volume
Issue
Startpage
Endpage
DOI
Abstract
This study investigates the repair strategies employed by large language models, during conversations with human interactors and the influence these strategies have on the human interactor’s perception. A corpus of 1123 conversations was collected and analysed, as well as a survey of 14 respondents. The results indicate that the chatbot is limited in its ability to resolve conversational errors and that hallucinations had no adverse influence on user experience. This research has implications for the development of open-domain dialogue systems and conversational agents in the form of evaluation metrics that can be used in order to create a realistic understanding of the capabilities of this technology.
Description
Citation
Faculty
Faculteit der Letteren