Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3269206.3272029acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Automatic Conversational Helpdesk Solution using Seq2Seq and Slot-filling Models

Published: 17 October 2018 Publication History

Abstract

Helpdesk is a key component of any large IT organization, where users can log a ticket about any issue they face related to IT infrastructure, administrative services, human resource services, etc. Normally, users have to assign appropriate set of labels to a ticket so that it could be routed to right domain expert who can help resolve the issue. In practice, the number of labels are very large and organized in form of a tree. It is non-trivial to describe the issue completely and attach appropriate labels unless one knows the cause of the problem and the related labels. Sometimes domain experts discuss the issue with the users and change the ticket labels accordingly, without modifying the ticket description. This results in inconsistent and badly labeled data, making it hard for supervised algorithms to learn from. In this paper, we propose a novel approach of creating a conversational helpdesk system, which will ask relevant questions to the user, for identification of the right category and will then raise a ticket on users' behalf. We use attention based seq2seq model to assign the hierarchical categories to tickets. We use a slot filling model to help us decide what questions to ask to the user, if the top-k model predictions are not consistent. We also present a novel approach to generate training data for the slot filling model automatically based on attention in the hierarchical classification model. We demonstrate via a simulated user that the proposed approach can give us a significant gain in accuracy on ticket-data without asking too many questions to users. Finally, we also show that our seq2seq model is as versatile as other approaches on publicly available datasets, as state of the art approaches.

References

[1]
Mucahit Altintas and Cuneyd Tantug. 2014. Machine Learning Based Ticket Classification in Issue Tracking Systems. In Proceeding of the International Conference on Artificial Intelligence and Computer Science (AICS).
[2]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR abs/1409.0473 (2014).
[3]
Jayme Garcia Arnal Barbedo and Amauri Lopes. 2007. Automatic Genre Classification of Musical Signals. EURASIP Journal on Advances in Signal Processing 2007, 1 (Jan. 2007).
[4]
Zafer Barutcuoglu and Christopher DeCoro. 2006. Hierarchical Shape Classification Using Bayesian Aggregation. In Proceedings of the IEEE International Conference on Shape Modeling and Applications 2006 (SMI '06).
[5]
Daniel Beneker and Carsten Gips. 2017. Using Clustering for Categorization of Support Tickets. In LWDA.
[6]
Juan José Burred and Alexander Lerch. 2003. A Hierarchical Approach to Automatic Musical Genre Classification. In Proceedings of the 6th International Conference on Digital Audio Effects (DAFx-03).
[7]
Anveshi Charuvaka and Huzefa Rangwala. 2015. HierCost: Improving Large Scale Hierarchical Classification with Cost Sensitive Learning. In Proceedings of the 2015th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I (ECMLPKDD'15). Springer, Switzerland.
[8]
Kyunghyun Cho, Bart van Merrienboer, et al. 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. (2014).
[9]
Stephen D'Alessio et al. 2000. The Effect of Using Hierarchical Classifiers in Text Categorization. In Content-Based Multimedia Information Access - Volume 1 (RIAO '00).
[10]
Ofer Dekel, Joseph Keshet, and Yoram Singer. 2004. Large Margin Hierarchical Classification. In Proceedings of the Twenty-first International Conference on Machine Learning (ICML '04). ACM.
[11]
Y. Diao, H. Jamjoom, and D. Loewenstern. 2009. Rule-Based Problem Classification in IT Service Management. In 2009 IEEE International Conference on Cloud Computing.
[12]
Alex A. Freitas. 2007. A Tutorial on Hierarchical Classification with Applications in Bioinformatics. In In: D. Taniar (Ed.) Research and Trends in Data Mining Technologies and Applications, Idea Group, 2007. 175--208.
[13]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Comput. (1997).
[14]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Optimization. CoRR (2014).
[15]
Daphne Koller and Mehran Sahami. 1997. Hierarchically Classifying Documents Using Very Few Words. In Proceedings of the Fourteenth International Conference on Machine Learning (ICML '97).
[16]
Kamran Kowsari, Donald E Brown, et al. 2017. HDLTex: Hierarchical Deep Learning for Text Classification. In 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA). 364--371.
[17]
Taku Kudo and Yuji Matsumoto. 2001. Chunking with Support Vector Machines. In Proceedings of the Second Meeting of the North American Chapter of the Association for Computational Linguistics on Language Technologies (NAACL '01). Association for Computational Linguistics.
[18]
Jiwei Li, Will Monroe, et al. 2016. Deep Reinforcement Learning for Dialogue Generation. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Austin, Texas.
[19]
Bing Liu and Ian Lane. 2015. Recurrent Neural Network Structured Output Prediction for Spoken Language Understanding.
[20]
Bing Liu and Ian Lane. 2016. Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling. CoRR abs/1609.01454 (2016).
[21]
Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective Approaches to Attention-based Neural Machine Translation. CoRR abs/1508.04025 (2015).
[22]
Senthil Mani, Neelamadhav Gantayat, et al. 2018. Hi, How Can I Help You?: Automating Enterprise IT Support Help Desks. CoRR abs/1711.02012 (2018).
[23]
G. Mesnil, Y. Dauphin, K. Yao, et al. 2015. Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding. IEEE/ACM Transactions on Audio, Speech, and Language Processing (2015).
[24]
Tomas Mikolov, Ilya Sutskever, et al. 2013. Distributed Representations of Words and Phrases and their Compositionality. In Advances in Neural Information Processing Systems 26.
[25]
Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global Vectors for Word Representation. In Empirical Methods in Natural Language Processing (EMNLP).
[26]
Christian Raymond and Giuseppe Riccardi. 2007. Generative and discriminative algorithms for spoken language understanding. In INTERSPEECH.
[27]
M. Schuster and K.K. Paliwal. November 1997. Bidirectional Recurrent Neural Networks. Trans. Sig. Proc. (November 1997).
[28]
Iulian V. Serban, Alessandro Sordoni, et al. 2016. Building End-to-end Dialogue Systems Using Generative Hierarchical Neural Network Models. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press.
[29]
Qihong Shao, Yi Chen, et al. 2008. EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution. Proc. VLDB Endow. 1, 2 (Aug. 2008).
[30]
Qihong Shao, Yi Chen, et al. 2008. Efficient Ticket Routing by Resolution Sequence Mining. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '08). ACM.
[31]
Carlos N. Silla and Alex A. Freitas. 2011. A survey of hierarchical classification across different application domains. Data Mining and Knowledge Discovery (2011).
[32]
Carlos N. Silla Jr. and Alex A. Freitas. 2009. A Global-Model Naive Bayes Approach to the Hierarchical Prediction of Protein Functions. In Proceedings of the 2009 Ninth IEEE International Conference on Data Mining (ICDM '09).
[33]
Nitish Srivastava, Geoffrey E Hinton, et al. 2014. Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research (2014).
[34]
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to Sequence Learning with Neural Networks. CoRR abs/1409.3215 (2014).
[35]
Gökhan Tür, Dilek Z. Hakkani-Tür, et al. 2011. Sentence simplification for spoken language understanding. 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2011), 5628--5631.
[36]
Oriol Vinyals and Quoc Le. 2015. A Neural Conversational Model. (06 2015).
[37]
Frank Wilcoxon. 1992. Individual Comparisons by Ranking Methods. Springer New York.
[38]
C. Zeng, T. Li, L. Shwartz, and G. Y. Grabarnik. 2014. Hierarchical multi-label classification over ticket data using contextual loss. In 2014 IEEE Network Operations and Management Symposium (NOMS).
[39]
W. Zhou, L. Tang, et al. 2016. Resolution Recommendation for Event Tickets in Service Management. IEEE Transactions on Network and Service Management 13 (2016).
[40]
Wubai Zhou, Wei Xue, et al. 2017. STAR: A System for Ticket Analysis and Resolution. In KDD.

Cited By

View all
  • (2021)Service chatbotsExpert Systems with Applications: An International Journal10.1016/j.eswa.2021.115461184:COnline publication date: 1-Dec-2021
  • (2020)Automatically Resolve Trouble Tickets with Hybrid NLP2020 IEEE Symposium Series on Computational Intelligence (SSCI)10.1109/SSCI47803.2020.9308327(1334-1340)Online publication date: 1-Dec-2020

Index Terms

  1. Automatic Conversational Helpdesk Solution using Seq2Seq and Slot-filling Models

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management
      October 2018
      2362 pages
      ISBN:9781450360142
      DOI:10.1145/3269206
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 17 October 2018

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. conversational system
      2. helpdesk
      3. hierarchical ticket classification
      4. sequence to sequence learning
      5. slot filling

      Qualifiers

      • Research-article

      Conference

      CIKM '18
      Sponsor:

      Acceptance Rates

      CIKM '18 Paper Acceptance Rate 147 of 826 submissions, 18%;
      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)15
      • Downloads (Last 6 weeks)3
      Reflects downloads up to 14 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2021)Service chatbotsExpert Systems with Applications: An International Journal10.1016/j.eswa.2021.115461184:COnline publication date: 1-Dec-2021
      • (2020)Automatically Resolve Trouble Tickets with Hybrid NLP2020 IEEE Symposium Series on Computational Intelligence (SSCI)10.1109/SSCI47803.2020.9308327(1334-1340)Online publication date: 1-Dec-2020

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media