Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3316782.3316792acmotherconferencesArticle/Chapter ViewAbstractPublication PagespetraConference Proceedingsconference-collections
research-article
Open access

Data generation approaches for topic classification in multilingual spoken dialog systems

Published: 05 June 2019 Publication History

Abstract

The conception of spoken-dialog systems (SDS) usually faces the problem of extending or adapting the system to multiple languages. This implies the creation of modules specifically for the new languages, which is a time consuming process. In this paper, we propose two methods to reduce the time needed to extend the SDS to other languages. Our methods are particularly oriented to the topic classification and semantic tagging tasks and we evaluate their effectiveness on topic classification for three languages: English, Spanish, French.

References

[1]
Jan Alexandersson, Norbert Reithinger, and Elisabeth Maier. 1997. Insights into the Dialogue Processing of VERBMOBIL. In Proceedings of the fifth conference on Applied natural language processing. Association for Computational Linguistics, 33--40.
[2]
Yoshua Bengio, Réjean Ducharme, Pascal Vincent, and Christian Jauvin. 2003. A neural probabilistic language model. Journal of machine learning research 3, Feb (2003), 1137--1155.
[3]
Luisa Bentivogli and Emanuele Pianta. 2000. Looking for lexical gaps. In Proceedings of the ninth EURALEX International Congress. Stuttgart: Universität Stuttgart, 8--12.
[4]
Luisa Bentivogli and Emanuele Pianta. 2005. Exploiting parallel texts in the creation of multilingual semantically annotated resources: the MultiSemCor Corpus. Natural Language Engineering 11, 3 (2005), 247--261.
[5]
Francis Bond and Kyonghee Paik. 2012. A survey of wordnets and their licenses. Small 8, 4 (2012), 5.
[6]
Francis Bond, Shan Wang, Eshley Huini Gao, Hazel Shuwen Mok, and Jeanette Yiwen Tan. 2013. Developing parallel sense-tagged corpora with wordnets. In Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse. 149--158.
[7]
Hongshen Chen, Xiaorui Liu, Dawei Yin, and Jiliang Tang. 2017. A Survey on Dialogue Systems: Recent Advances and New Frontiers. CoRR abs/1711.01731 (2017). http://arxiv.org/abs/1711.01731
[8]
Elliott Franco Drábek and David Yarowsky. 2005. Induction of fine-grained part-of-speech taggers via classifier combination and crosslingual projection. In Proceedings of the ACL Workshop on Building and Using Parallel Texts. Association for Computational Linguistics, 49--56.
[9]
Tom Fawcett. 2006. An introduction to ROC analysis. Pattern recognition letters 27, 8 (2006), 861--874.
[10]
Alfio Massimiliano Gliozzo, Marcello Ranieri, and Carlo Strapparava. 2005. Crossing parallel corpora and multilingual lexical databases for WSD. In International Conference on Intelligent Text Processing and Computational Linguistics. Springer, 242--245.
[11]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735--1780.
[12]
Hartwig Holzapfel. 2005. Towards development of multilingual spoken dialogue systems. In Proceedings of the 2nd Language and Technology Conference.
[13]
Rebecca Hwa, Philip Resnik, and Amy Weinberg. 2005. Breaking the resource bottleneck for multilingual parsing. Technical Report. Maryland University College Park. Institute for Advanced Computer Studies.
[14]
Ivo Ipsic, Nikola Pavesic, France Mihelic, and Elmar Noth. 1999. Multilingual spoken dialog system. In Proceedings of the IEEE International Symposium on Industrial Electronics, 1999. ISIE-99., Vol. 1. IEEE, 183--187.
[15]
Asier López Zorrilla, Mikel de Velasco Vázquez, Jon Irastorza, Javier Mikel Olaso Fernández, Raquel Justo Blanco, and María Inés Torres Barañano. 2018. EMPATHIC: Empathic, Expressive, Advanced Virtual Coach to Improve Independent Healthy-Life-Years of the Elderly. Procesamiento del Lenguaje Natural 61 (2018), 167--170.
[16]
Helen Meng, Shuk Fong Chan, Yee Fong Wong, Tien Ying Fung, Wai Ching Tsui, Tin Hang Lo, Cheong Chat Chan, Ke Chen, Lan Wang, Ting Yao Wu, et al. 2000. ISIS: A Multilingual Spoken Dialog System developed with CORBA and KQML agents. In Sixth International Conference on Spoken Language Processing.
[17]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111--3119.
[18]
George A Miller. 1995. WordNet: a lexical database for English. Commun. ACM 38, 11 (1995), 39--41.
[19]
Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. Glove: Global Vectors for Word Representation. In Empirical Methods in Natural Language Processing (EMNLP), Vol. 14. 1532--1543.
[20]
Tommaso Petrolito and Francis Bond. 2014. A survey of wordnet annotated corpora. In Proceedings of the Seventh Global WordNet Conference. 236--245.
[21]
Benoît Sagot and Darja Fišer. 2008. Building a free French wordnet from multilingual resources. In OntoLex.
[22]
Piek Vossen. 1998. A multilingual database with lexical semantic networks. Dordrecht: Kluwer Academic Publishers. doi 10 (1998), 978--94.
[23]
Saizheng Zhang, Emily Dinan, Jack Urbanek, Arthur Szlam, Douwe Kiela, and Jason Weston. 2018. Personalizing Dialogue Agents: I have a dog, do you have pets too? CoRR abs/1801.07243 (2018). http://arxiv.org/abs/1801.07243

Cited By

View all
  • (2022)A Multilingual Neural Coaching Model with Enhanced Long-term Dialogue StructureACM Transactions on Interactive Intelligent Systems10.1145/348706612:2(1-47)Online publication date: 12-Jul-2022
  • (2021)Machine Learning and Natural Language Processing in Domain Classification of Scientific Knowledge Objects: A ReviewAdvances in Information and Communication10.1007/978-3-030-73103-8_55(773-784)Online publication date: 16-Apr-2021
  • (2020) Transfer learning in hierarchical dialogue topic classification with neural networks * 2020 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN48605.2020.9206680(1-8)Online publication date: Jul-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
PETRA '19: Proceedings of the 12th ACM International Conference on PErvasive Technologies Related to Assistive Environments
June 2019
655 pages
ISBN:9781450362320
DOI:10.1145/3316782
This work is licensed under a Creative Commons Attribution-NonCommercial International 4.0 License.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 June 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bilingual datasets
  2. neural networks
  3. spoken dialog systems
  4. topic classification

Qualifiers

  • Research-article

Funding Sources

Conference

PETRA '19

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)47
  • Downloads (Last 6 weeks)10
Reflects downloads up to 27 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)A Multilingual Neural Coaching Model with Enhanced Long-term Dialogue StructureACM Transactions on Interactive Intelligent Systems10.1145/348706612:2(1-47)Online publication date: 12-Jul-2022
  • (2021)Machine Learning and Natural Language Processing in Domain Classification of Scientific Knowledge Objects: A ReviewAdvances in Information and Communication10.1007/978-3-030-73103-8_55(773-784)Online publication date: 16-Apr-2021
  • (2020) Transfer learning in hierarchical dialogue topic classification with neural networks * 2020 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN48605.2020.9206680(1-8)Online publication date: Jul-2020
  • (2019)A Dialogue-Act Taxonomy for a Virtual Coach Designed to Improve the Life of ElderlyMultimodal Technologies and Interaction10.3390/mti30300523:3(52)Online publication date: 11-Jul-2019

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media