Automatic Evaluation of Chat-Oriented Dialogue Systems Using Large-Scale Multi-references.

AllImages Videos Books Maps News Shopping

Scholarly articles for Automatic Evaluation of Chat-Oriented Dialogue Systems Using Large-Scale Multi-references.

scholar.google.com › citations

… chat-oriented dialogue systems using large-scale multi …
Sugiyama · Cited by 8

[PDF] Automatic Evaluation of Chat-oriented Dialogue Systems using ... - Uni Ulm

We propose a regression-based automatic evaluation method that evaluates the utterances generated by chat-oriented dialogue systems based on the similarities to ...

Automatic Evaluation of Chat-Oriented Dialogue Systems ...

www.researchgate.net › publication › 32...

Automatic Evaluation of Chat-Oriented Dialogue Systems Using Large-Scale Multi-references: 8th International Workshop on Spoken Dialog Systems. January 2019.

Automatic Evaluation of Chat-Oriented Dialogue Systems Using ...

colab.ws › articles

Sugiyama H. et al. Automatic Evaluation of Chat-Oriented Dialogue Systems Using Large-Scale Multi-references // Lecture Notes in Electrical Engineering. 2018.

[PDF] arXiv:2006.06110v1 [cs.CL] 10 Jun 2020

arxiv.org › pdf

Jun 10, 2020 · The evaluation of chat-oriented dialogue systems has been typically accomplished through the use of automated metrics and human evaluation (Sec-.

Survey on evaluation methods for dialogue systems - Semantic Scholar

www.semanticscholar.org › paper › Surv...

270 References · Automatic Evaluation of Chat-Oriented Dialogue Systems Using Large-Scale Multi-references · Real User Evaluation of Spoken Dialogue Systems Using ...

Advanced Social Interaction with Agents : 8th International ...

galileo-georgiasouthern-psb.primo.exlibrisgroup.com › ...

This book presents lectures given at the 8th International Workshop on Spoken Dialog Systems. As agents evolve in terms of their ability to carry on a dialog ...

Evaluating Dialogs based on Grice's Maxims - ResearchGate

www.researchgate.net › publication › 32...

The automatic evaluation of chat-oriented dialogue systems remains an open problem. Most studies have evaluated them by hand, but this approach requires huge ...

[PDF] arXiv:1907.10568v2 [cs.CL] 8 Sep 2019 - Amy Pavel

amypavel.com › papers › multiref

The aim of this paper is to mitigate the shortcomings of automatic evaluation of open-domain dialog systems through multi- reference evaluation.

[PDF] Investigating Evaluation of Open-Domain Dialogue Systems With ...

www.semanticscholar.org › paper › Inve...

A series of experiments show that the use of multiple references results in improved correlation between several automatic metrics and human judgement for ...

Improving Dialog Evaluation with a Multi-reference Adversarial Dataset ...

direct.mit.edu › tacl › doi › tacl_a_00347

Dec 1, 2020 · We propose a new BERT-based evaluation metric called DEB, which is pretrained on 727M Reddit conversations and then finetuned on our dataset.