Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3397271.3401121acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Learning with Weak Supervision for Email Intent Detection

Published: 25 July 2020 Publication History

Abstract

Email remains one of the most frequently used means of online communication. People spend significant amount of time every day on emails to exchange information, manage tasks and schedule events. Previous work has studied different ways for improving email productivity by prioritizing emails, suggesting automatic replies or identifying intents to recommend appropriate actions. The problem has been mostly posed as a supervised learning problem where models of different complexities were proposed to classify an email message into a predefined taxonomy of intents or classes. The need for labeled data has always been one of the largest bottlenecks in training supervised models. This is especially the case for many real-world tasks, such as email intent classification, where large scale annotated examples are either hard to acquire or unavailable due to privacy or data access constraints. Email users often take actions in response to intents expressed in an email (e.g., setting up a meeting in response to an email with a scheduling request). Such actions can be inferred from user interaction logs. In this paper, we propose to leverage user actions as a source of weak supervision, in addition to a limited set of annotated examples, to detect intents in emails. We develop an end-to-end robust deep neural network model for email intent identification that leverages both clean annotated data and noisy weak supervision along with a self-paced learning mechanism. Extensive experiments on three different intent detection tasks show that our approach can effectively leverage the weakly supervised data to improve intent detection in emails.

References

[1]
Email statistics report. The Radicati Group, INC., 2015.
[2]
Eugene Agichtein, Eric Brill, and Susan Dumais. Improving web search ranking by incorporating user behavior information. In SIGIR, 2006.
[3]
Qingyao Ai, Susan T Dumais, Nick Craswell, and Dan Liebling. Characterizing email search using large-scale behavioral logs and surveys. In WWW, 2017.
[4]
Hosein Azarbonyad, Robert Sim, and Ryen W White. Domain adaptation for commitment detection in email. In WSDM, 2019.
[5]
Yoshua Bengio, Jérôme Louradour, Ronan Collobert, and Jason Weston. Curriculum learning. In ICML, 2009.
[6]
Paul N. Bennett and Jaime Carbonell. Detecting action-items in e-mail. In SIGIR, 2005.
[7]
Paul N Bennett and Jaime G Carbonell. Detecting action items in email. 2005.
[8]
Moses Charikar, Jacob Steinhardt, and Gregory Valiant. Learning from untrusted data. In SIGACT, 2017.
[9]
Michael Chui, James Manyika, Jacques Bughin, Richard Dobbs, Charles Roxburgh, Hugo Sarrazin, Georey Sands, and Magdalena Westergren. The social economy: Unlocking value and productivity through social technologies. McKinsey Global Institute., 2012.
[10]
Jacob Cohen. A coefficient of agreement for nominal scales. Educational and psychological measurement, 20(1):37--46, 1960.
[11]
William W. Cohen, Vitor R. Carvalho, and Tom M. Mitchell. Learning to classify email into speech acts. In In Proceedings of Empirical Methods in Natural Language Processing, 2004.
[12]
Laura A. Dabbish, Robert E. Kraut, Susan Fussell, and Sara Kiesler. Understanding email use: Predicting action on a message. In CHI. ACM, 2005.
[13]
Laura A Dabbish, Robert E Kraut, Susan Fussell, and Sara Kiesler. Understanding email use: predicting action on a message. In CHI, 2005.
[14]
Mostafa Dehghani, Hamed Zamani, Aliaksei Severyn, Jaap Kamps, and W Bruce Croft. Neural ranking models with weak supervision. In SIGIR, 2017.
[15]
Benoît Frénay and Michel Verleysen. Classification in the presence of label noise: a survey. IEEE transactions on neural networks and learning systems, 25(5):845--869, 2013.
[16]
Liang Ge, Jing Gao, Xiaoyi Li, and Aidong Zhang. Multi-source deep learning for information trustworthiness estimation. In KDD, 2013.
[17]
Alex Graves and Jürgen Schmidhuber. Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural networks, 18(5--6):602--610, 2005.
[18]
Ahmed Hassan, Rosie Jones, and Kristina Lisa Klinkner. Beyond dcg: user behavior as a predictor of a successful search. In WSDM, 2010.
[19]
Dan Hendrycks, Mantas Mazeika, Duncan Wilson, and Kevin Gimpel. Using trusted data to train deep networks on labels corrupted by severe noise. In NeurIPS, 2018.
[20]
Eric Horvitz. Principles of mixed-initiative user interfaces. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI '99, pages 159--166, New York, NY, USA, 1999. ACM.
[21]
Eric Horvitz, Andy Jacobs, and David Hovel. Attention-sensitive alerting. In UAI, 1999.
[22]
Meng Jiang, Peng Cui, Rui Liu, Qiang Yang, Fei Wang, Wenwu Zhu, and Shiqiang Yang. Social contextual recommendation. In CIKM, 2012.
[23]
Thorsten Joachims, Laura Granka, Bing Pan, Helene Hembrooke, Filip Radlinski, and Geri Gay. Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search. TOIS, 25(2):7, 2007.
[24]
Anjuli Kannan, Karol Kurach, Sujith Ravi, Tobias Kaufmann, Andrew Tomkins, Balint Miklos, Greg Corrado, László Lukács, Marina Ganea, Peter Young, et al. Smart reply: Automated response suggestion for email. arXiv preprint arXiv:1606.04870, 2016.
[25]
Bryan Klimt and Yiming Yang. The enron corpus: A new dataset for email classification research. In ECML, 2004.
[26]
Farshad Kooti, Luca Maria Aiello, Mihajlo Grbovic, Kristina Lerman, and Amin Mantrach. Evolution of conversations in the age of email overload. In WWW, 2015.
[27]
M Pawan Kumar, Benjamin Packer, and Daphne Koller. Self-paced learning for latent variable models. In NeurIPS, 2010.
[28]
Andrew Lampert, Robert Dale, and Cecile Paris. Detecting emails containing requests for action. In HLT, 2010.
[29]
Andrew Lampert, Robert Dale, and Cecile Paris. Detecting emails containing requests for action. In NAACL, 2010.
[30]
Yuncheng Li, Jianchao Yang, Yale Song, Liangliang Cao, Jiebo Luo, and Li-Jia Li. Learning from noisy labels with distillation. In ICCV, 2017.
[31]
Chu-Cheng Lin, Dongyeop Kang, Michael Gamon, and Patrick Pantel. Actionable email intent modeling with reparametrized rnns. In AAAI, 2018.
[32]
Cheng Luo, Yukun Zheng, Jiaxin Mao, Yiqun Liu, Min Zhang, and Shaoping Ma. Training deep ranking model with weak relevance labels. In Zi Huang, Xiaokui Xiao, and Xin Cao, editors, Databases Theory and Applications, pages 205--216, Cham, 2017. Springer International Publishing.
[33]
Joel Mackenzie, Kshitiz Gupta, Fang Qiao, Ahmed Hassan Awadallah, and Milad Shokouhi. Exploring user behavior in email re-finding tasks. In WWW, 2019.
[34]
Yu Meng, Jiaming Shen, Chao Zhang, and Jiawei Han. Weakly-supervised hierarchical text classification. In AAAI, 2019.
[35]
Bhaskar Mitra, Fernando Diaz, and Nick Craswell. Learning to match using local and distributed representations of text for web search. In WWW, 2017.
[36]
Nagarajan Natarajan, Inderjit S Dhillon, Pradeep K Ravikumar, and Ambuj Tewari. Learning with noisy labels. In NeurIPS, 2013.
[37]
David F Nettleton, Albert Orriols-Puig, and Albert Fornells. A study of the effect of different types of noise on the precision of supervised learning techniques. Artificial intelligence review, 33(4):275--306, 2010.
[38]
Douglas Oard, William Webber, David Kirsch, and Sergey Golitsynskiy. Avocado research email collection. Philadelphia: Linguistic Data Consortium, 2015.
[39]
Wanli Ouyang, Xiao Chu, and Xiaogang Wang. Multi-source deep learning for human pose estimation. In CVPR, 2014.
[40]
Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, and Lizhen Qu. Making deep neural networks robust to label noise: A loss correction approach. In CVPR, 2017.
[41]
Jeffrey Pennington, Richard Socher, and Christopher Manning. Glove: Global vectors for word representation. In EMNLP, 2014.
[42]
Alexander Ratner, Stephen H Bach, Henry Ehrenberg, Jason Fries, Sen Wu, and Christopher Ré. Snorkel: Rapid training data creation with weak supervision. VLDB.
[43]
Alexander Ratner, Braden Hancock, Jared Dunnmon, Frederic Sala, Shreyash Pandey, and Christopher Ré. Training complex models with multi-task weak supervision. arXiv preprint arXiv:1810.02840, 2018.
[44]
Scott Reed, Honglak Lee, Dragomir Anguelov, Christian Szegedy, Dumitru Erhan, and Andrew Rabinovich. Training deep neural networks on noisy labels with bootstrapping. arXiv preprint arXiv:1412.6596, 2014.
[45]
Mengye Ren, Wenyuan Zeng, Bin Yang, and Raquel Urtasun. Learning to reweight examples for robust deep learning. arXiv preprint arXiv:1803.09050, 2018.
[46]
Maya Sappelli, Gabriella Pasi, Suzan Verberne, Maaike de Boer, and Wessel Kraaij. Assessing e-mail intent and tasks in e-mail messages. Information Sciences, 358:1--17, 2016.
[47]
Bahareh Sarrafzadeh, Ahmed Hassan Awadallah, Christopher H Lin, Chia-Jung Lee, Milad Shokouhi, and Susan T Dumais. Characterizing and predicting email deferral behavior. In WSDM, 2019.
[48]
Sainbayar Sukhbaatar, Joan Bruna, Manohar Paluri, Lubomir Bourdev, and Rob Fergus. Training convolutional networks with noisy labels. arXiv preprint arXiv:1406.2080, 2014.
[49]
Paroma Varma, Frederic Sala, Ann He, Alexander Ratner, and Christopher Ré. Learning dependency structures for weak supervision models. ICML, 2019.
[50]
Wei Wang, Saghar Hosseini, Ahmed Hassan Awadallah, Paul N. Bennett, and Chris Quirk. Context-aware intent identification in email conversations. In SIGIR, 2019.
[51]
Wei Wang, Saghar Hosseini, Ahmed Hassan Awadallah, Paul N Bennett, and Chris Quirk. Context-aware intent identification in email conversations. In SIGIR, 2019.
[52]
Liu Yang, Susan T Dumais, Paul N Bennett, and Ahmed Hassan Awadallah. Characterizing and predicting enterprise email reply behavior. In SIGIR, 2017.
[53]
Xiao Yang, Ahmed Hassan Awadallah, Madian Khabsa, Wei Wang, and Miaosen Wang. Characterizing and supporting question answering in human-to-human communication. In SIGIR, 2018.
[54]
Matthew D Zeiler. Adadelta: an adaptive learning rate method. arXiv preprint arXiv:1212.5701, 2012.
[55]
Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals. Understanding deep learning requires rethinking generalization, 2016.

Cited By

View all
  • (2024)A Weak Supervision-Based Approach to Improve Chatbots for Code RepositoriesProceedings of the ACM on Software Engineering10.1145/36608121:FSE(2378-2401)Online publication date: 12-Jul-2024
  • (2024)Better Graph Embeddings for Enterprise GraphsProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632412(368-374)Online publication date: 4-Jan-2024
  • (2024)LabelAId: Just-in-time AI Interventions for Improving Human Labeling Quality and Domain Knowledge in Crowdsourcing SystemsProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642089(1-21)Online publication date: 11-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2020
2548 pages
ISBN:9781450380164
DOI:10.1145/3397271
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. email intent detection
  2. natural language understanding
  3. weak supervision

Qualifiers

  • Research-article

Conference

SIGIR '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)29
  • Downloads (Last 6 weeks)3
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Weak Supervision-Based Approach to Improve Chatbots for Code RepositoriesProceedings of the ACM on Software Engineering10.1145/36608121:FSE(2378-2401)Online publication date: 12-Jul-2024
  • (2024)Better Graph Embeddings for Enterprise GraphsProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632412(368-374)Online publication date: 4-Jan-2024
  • (2024)LabelAId: Just-in-time AI Interventions for Improving Human Labeling Quality and Domain Knowledge in Crowdsourcing SystemsProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642089(1-21)Online publication date: 11-May-2024
  • (2024)Deep Learning based Predictive Analytics for Employees Turnover with Performance Prediction in Banking Sector2024 3rd International Conference on Sentiment Analysis and Deep Learning (ICSADL)10.1109/ICSADL61749.2024.00057(312-317)Online publication date: 13-Mar-2024
  • (2024)Deep Learning based Predictive Analytics for Employees Turnover with Performance Prediction in Banking Sector2024 International Conference on Inventive Computation Technologies (ICICT)10.1109/ICICT60155.2024.10544552(933-938)Online publication date: 24-Apr-2024
  • (2024)Detection of Objectionable Song Lyrics Using Weakly Supervised Learning and Natural Language Processing TechniquesProcedia Computer Science10.1016/j.procs.2024.04.183235(1929-1942)Online publication date: 2024
  • (2024)Transformer models for mining intents and predicting activities from emails in knowledge-intensive processesEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.107450128(107450)Online publication date: Feb-2024
  • (2023)Building a Multimodal Classifier of Email Behavior: Towards a Social Network Understanding of Organizational CommunicationInformation10.3390/info1412066114:12(661)Online publication date: 14-Dec-2023
  • (2023)Identification and Generation of Actions Using Pre-trained Language ModelsWeb Information Systems Engineering – WISE 202310.1007/978-981-99-7254-8_52(674-683)Online publication date: 21-Oct-2023
  • (2023)Transformer Models for Activity Mining in Knowledge-Intensive ProcessesBusiness Process Management Workshops10.1007/978-3-031-25383-6_2(13-24)Online publication date: 9-Feb-2023
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media