Evaluating hallucinations and repair in open-domain dialogue systems.
Keywords
Loading...
Authors
Issue Date
2023-01-28
Language
en
Document type
Journal Title
Journal ISSN
Volume Title
Publisher
Title
ISSN
Volume
Issue
Startpage
Endpage
DOI
Abstract
This study investigates the repair strategies employed by large language models,
during conversations with human interactors and the influence these strategies have on the
human interactor’s perception. A corpus of 1123 conversations was collected and analysed, as
well as a survey of 14 respondents. The results indicate that the chatbot is limited in its ability
to resolve conversational errors and that hallucinations had no adverse influence on user
experience. This research has implications for the development of open-domain dialogue
systems and conversational agents in the form of evaluation metrics that can be used in order
to create a realistic understanding of the capabilities of this technology.
Description
Citation
Supervisor
Faculty
Faculteit der Letteren