research-article

Learning with Weak Supervision for Email Intent Detection

Authors:

Subhabrata Mukherjee,

Ahmed Hassan Awadallah,

Milad Shokouhi,

Susan DumaisAuthors Info & Claims

SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 1051 - 1060

https://doi.org/10.1145/3397271.3401121

Published: 25 July 2020 Publication History

Abstract

Email remains one of the most frequently used means of online communication. People spend significant amount of time every day on emails to exchange information, manage tasks and schedule events. Previous work has studied different ways for improving email productivity by prioritizing emails, suggesting automatic replies or identifying intents to recommend appropriate actions. The problem has been mostly posed as a supervised learning problem where models of different complexities were proposed to classify an email message into a predefined taxonomy of intents or classes. The need for labeled data has always been one of the largest bottlenecks in training supervised models. This is especially the case for many real-world tasks, such as email intent classification, where large scale annotated examples are either hard to acquire or unavailable due to privacy or data access constraints. Email users often take actions in response to intents expressed in an email (e.g., setting up a meeting in response to an email with a scheduling request). Such actions can be inferred from user interaction logs. In this paper, we propose to leverage user actions as a source of weak supervision, in addition to a limited set of annotated examples, to detect intents in emails. We develop an end-to-end robust deep neural network model for email intent identification that leverages both clean annotated data and noisy weak supervision along with a self-paced learning mechanism. Extensive experiments on three different intent detection tasks show that our approach can effectively leverage the weakly supervised data to improve intent detection in emails.

References

[1]

Email statistics report. The Radicati Group, INC., 2015.

[2]

Eugene Agichtein, Eric Brill, and Susan Dumais. Improving web search ranking by incorporating user behavior information. In SIGIR, 2006.

Digital Library

[3]

Qingyao Ai, Susan T Dumais, Nick Craswell, and Dan Liebling. Characterizing email search using large-scale behavioral logs and surveys. In WWW, 2017.

[4]

Hosein Azarbonyad, Robert Sim, and Ryen W White. Domain adaptation for commitment detection in email. In WSDM, 2019.

Digital Library

[5]

Yoshua Bengio, Jérôme Louradour, Ronan Collobert, and Jason Weston. Curriculum learning. In ICML, 2009.

Digital Library

[6]

Paul N. Bennett and Jaime Carbonell. Detecting action-items in e-mail. In SIGIR, 2005.

[7]

Paul N Bennett and Jaime G Carbonell. Detecting action items in email. 2005.

[8]

Moses Charikar, Jacob Steinhardt, and Gregory Valiant. Learning from untrusted data. In SIGACT, 2017.

Digital Library

[9]

Michael Chui, James Manyika, Jacques Bughin, Richard Dobbs, Charles Roxburgh, Hugo Sarrazin, Georey Sands, and Magdalena Westergren. The social economy: Unlocking value and productivity through social technologies. McKinsey Global Institute., 2012.

[10]

Jacob Cohen. A coefficient of agreement for nominal scales. Educational and psychological measurement, 20(1):37--46, 1960.

[11]

William W. Cohen, Vitor R. Carvalho, and Tom M. Mitchell. Learning to classify email into speech acts. In In Proceedings of Empirical Methods in Natural Language Processing, 2004.

[12]

Laura A. Dabbish, Robert E. Kraut, Susan Fussell, and Sara Kiesler. Understanding email use: Predicting action on a message. In CHI. ACM, 2005.

Digital Library

[13]

Laura A Dabbish, Robert E Kraut, Susan Fussell, and Sara Kiesler. Understanding email use: predicting action on a message. In CHI, 2005.

Digital Library

[14]

Mostafa Dehghani, Hamed Zamani, Aliaksei Severyn, Jaap Kamps, and W Bruce Croft. Neural ranking models with weak supervision. In SIGIR, 2017.

Digital Library

[15]

Benoît Frénay and Michel Verleysen. Classification in the presence of label noise: a survey. IEEE transactions on neural networks and learning systems, 25(5):845--869, 2013.

[16]

Liang Ge, Jing Gao, Xiaoyi Li, and Aidong Zhang. Multi-source deep learning for information trustworthiness estimation. In KDD, 2013.

Digital Library

[17]

Alex Graves and Jürgen Schmidhuber. Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural networks, 18(5--6):602--610, 2005.

[18]

Ahmed Hassan, Rosie Jones, and Kristina Lisa Klinkner. Beyond dcg: user behavior as a predictor of a successful search. In WSDM, 2010.

Digital Library

[19]

Dan Hendrycks, Mantas Mazeika, Duncan Wilson, and Kevin Gimpel. Using trusted data to train deep networks on labels corrupted by severe noise. In NeurIPS, 2018.

[20]

Eric Horvitz. Principles of mixed-initiative user interfaces. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI '99, pages 159--166, New York, NY, USA, 1999. ACM.

Digital Library

[21]

Eric Horvitz, Andy Jacobs, and David Hovel. Attention-sensitive alerting. In UAI, 1999.

Digital Library

[22]

Meng Jiang, Peng Cui, Rui Liu, Qiang Yang, Fei Wang, Wenwu Zhu, and Shiqiang Yang. Social contextual recommendation. In CIKM, 2012.

Digital Library

[23]

Thorsten Joachims, Laura Granka, Bing Pan, Helene Hembrooke, Filip Radlinski, and Geri Gay. Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search. TOIS, 25(2):7, 2007.

Digital Library

[24]

Anjuli Kannan, Karol Kurach, Sujith Ravi, Tobias Kaufmann, Andrew Tomkins, Balint Miklos, Greg Corrado, László Lukács, Marina Ganea, Peter Young, et al. Smart reply: Automated response suggestion for email. arXiv preprint arXiv:1606.04870, 2016.

[25]

Bryan Klimt and Yiming Yang. The enron corpus: A new dataset for email classification research. In ECML, 2004.

Digital Library

[26]

Farshad Kooti, Luca Maria Aiello, Mihajlo Grbovic, Kristina Lerman, and Amin Mantrach. Evolution of conversations in the age of email overload. In WWW, 2015.

Digital Library

[27]

M Pawan Kumar, Benjamin Packer, and Daphne Koller. Self-paced learning for latent variable models. In NeurIPS, 2010.

[28]

Andrew Lampert, Robert Dale, and Cecile Paris. Detecting emails containing requests for action. In HLT, 2010.

[29]

Andrew Lampert, Robert Dale, and Cecile Paris. Detecting emails containing requests for action. In NAACL, 2010.

[30]

Yuncheng Li, Jianchao Yang, Yale Song, Liangliang Cao, Jiebo Luo, and Li-Jia Li. Learning from noisy labels with distillation. In ICCV, 2017.

[31]

Chu-Cheng Lin, Dongyeop Kang, Michael Gamon, and Patrick Pantel. Actionable email intent modeling with reparametrized rnns. In AAAI, 2018.

[32]

Cheng Luo, Yukun Zheng, Jiaxin Mao, Yiqun Liu, Min Zhang, and Shaoping Ma. Training deep ranking model with weak relevance labels. In Zi Huang, Xiaokui Xiao, and Xin Cao, editors, Databases Theory and Applications, pages 205--216, Cham, 2017. Springer International Publishing.

[33]

Joel Mackenzie, Kshitiz Gupta, Fang Qiao, Ahmed Hassan Awadallah, and Milad Shokouhi. Exploring user behavior in email re-finding tasks. In WWW, 2019.

[34]

Yu Meng, Jiaming Shen, Chao Zhang, and Jiawei Han. Weakly-supervised hierarchical text classification. In AAAI, 2019.

[35]

Bhaskar Mitra, Fernando Diaz, and Nick Craswell. Learning to match using local and distributed representations of text for web search. In WWW, 2017.

Digital Library

[36]

Nagarajan Natarajan, Inderjit S Dhillon, Pradeep K Ravikumar, and Ambuj Tewari. Learning with noisy labels. In NeurIPS, 2013.

[37]

David F Nettleton, Albert Orriols-Puig, and Albert Fornells. A study of the effect of different types of noise on the precision of supervised learning techniques. Artificial intelligence review, 33(4):275--306, 2010.

[38]

Douglas Oard, William Webber, David Kirsch, and Sergey Golitsynskiy. Avocado research email collection. Philadelphia: Linguistic Data Consortium, 2015.

[39]

Wanli Ouyang, Xiao Chu, and Xiaogang Wang. Multi-source deep learning for human pose estimation. In CVPR, 2014.

Digital Library

[40]

Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, and Lizhen Qu. Making deep neural networks robust to label noise: A loss correction approach. In CVPR, 2017.

[41]

Jeffrey Pennington, Richard Socher, and Christopher Manning. Glove: Global vectors for word representation. In EMNLP, 2014.

[42]

Alexander Ratner, Stephen H Bach, Henry Ehrenberg, Jason Fries, Sen Wu, and Christopher Ré. Snorkel: Rapid training data creation with weak supervision. VLDB.

[43]

Alexander Ratner, Braden Hancock, Jared Dunnmon, Frederic Sala, Shreyash Pandey, and Christopher Ré. Training complex models with multi-task weak supervision. arXiv preprint arXiv:1810.02840, 2018.

[44]

Scott Reed, Honglak Lee, Dragomir Anguelov, Christian Szegedy, Dumitru Erhan, and Andrew Rabinovich. Training deep neural networks on noisy labels with bootstrapping. arXiv preprint arXiv:1412.6596, 2014.

[45]

Mengye Ren, Wenyuan Zeng, Bin Yang, and Raquel Urtasun. Learning to reweight examples for robust deep learning. arXiv preprint arXiv:1803.09050, 2018.

[46]

Maya Sappelli, Gabriella Pasi, Suzan Verberne, Maaike de Boer, and Wessel Kraaij. Assessing e-mail intent and tasks in e-mail messages. Information Sciences, 358:1--17, 2016.

Digital Library

[47]

Bahareh Sarrafzadeh, Ahmed Hassan Awadallah, Christopher H Lin, Chia-Jung Lee, Milad Shokouhi, and Susan T Dumais. Characterizing and predicting email deferral behavior. In WSDM, 2019.

Digital Library

[48]

Sainbayar Sukhbaatar, Joan Bruna, Manohar Paluri, Lubomir Bourdev, and Rob Fergus. Training convolutional networks with noisy labels. arXiv preprint arXiv:1406.2080, 2014.

[49]

Paroma Varma, Frederic Sala, Ann He, Alexander Ratner, and Christopher Ré. Learning dependency structures for weak supervision models. ICML, 2019.

[50]

Wei Wang, Saghar Hosseini, Ahmed Hassan Awadallah, Paul N. Bennett, and Chris Quirk. Context-aware intent identification in email conversations. In SIGIR, 2019.

Digital Library

[51]

Wei Wang, Saghar Hosseini, Ahmed Hassan Awadallah, Paul N Bennett, and Chris Quirk. Context-aware intent identification in email conversations. In SIGIR, 2019.

Digital Library

[52]

Liu Yang, Susan T Dumais, Paul N Bennett, and Ahmed Hassan Awadallah. Characterizing and predicting enterprise email reply behavior. In SIGIR, 2017.

Digital Library

[53]

Xiao Yang, Ahmed Hassan Awadallah, Madian Khabsa, Wei Wang, and Miaosen Wang. Characterizing and supporting question answering in human-to-human communication. In SIGIR, 2018.

Digital Library

[54]

Matthew D Zeiler. Adadelta: an adaptive learning rate method. arXiv preprint arXiv:1212.5701, 2012.

[55]

Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals. Understanding deep learning requires rethinking generalization, 2016.

Cited By

Farhour FAbdellatif AMansour EShihab E(2024)A Weak Supervision-Based Approach to Improve Chatbots for Code RepositoriesProceedings of the ACM on Software Engineering10.1145/36608121:FSE(2378-2401)Online publication date: 12-Jul-2024
https://dl.acm.org/doi/10.1145/3660812
Gupta RKrishnamoorthy MVangala V(2024)Better Graph Embeddings for Enterprise GraphsProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632412(368-374)Online publication date: 4-Jan-2024
https://dl.acm.org/doi/10.1145/3632410.3632412
Li CZhang ZSaugstad MSafranchik EKulkarni CHuang XPatel SIyer VAlthoff TFroehlich J(2024)LabelAId: Just-in-time AI Interventions for Improving Human Labeling Quality and Domain Knowledge in Crowdsourcing SystemsProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642089(1-21)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642089
Show More Cited By

Index Terms

Learning with Weak Supervision for Email Intent Detection
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction
  2. Machine learning
    1. Learning settings
      1. Learning from implicit feedback

Recommendations

Few-shot Node Classification with Extremely Weak Supervision
WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining

Few-shot node classification aims at classifying nodes with limited labeled nodes as references. Recent few-shot node classification methods typically learn from classes with abundant labeled nodes (i.e., meta-training classes) and then generalize to ...
Generalized Weak Supervision for Neural Information Retrieval
Neural ranking models (NRMs) have demonstrated effective performance in several information retrieval (IR) tasks. However, training NRMs often requires large-scale training data, which is difficult and expensive to obtain. To address this issue, one can ...
Neural Ranking Models with Weak Supervision
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

Despite the impressive improvements achieved by unsupervised deep neural networks in computer vision and NLP tasks, such improvements have not yet been observed in ranking for information retrieval. The reason may be the complexity of the ranking ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 2020

2548 pages

ISBN:9781450380164

DOI:10.1145/3397271

General Chairs:
Jimmy Huang
York University, Canada
,
Yi Chang
Jilin University, China
,
Xueqi Cheng
Chinese Academy of Sciences, China
,
Program Chairs:
Jaap Kamps
University of Amsterdam, Netherlands
,
Vanessa Murdock
Amazon, U.S.A.
,
Ji-Rong Wen
Renmin University of China, China
,
Yiqun Liu
Tsinghua University, China

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SIGIR '20

Sponsor:

SIGIR

SIGIR '20: The 43rd International ACM SIGIR conference on research and development in Information Retrieval

July 25 - 30, 2020

Virtual Event, China

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
406
Total Downloads

Downloads (Last 12 months)29
Downloads (Last 6 weeks)3

Reflects downloads up to 26 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Farhour FAbdellatif AMansour EShihab E(2024)A Weak Supervision-Based Approach to Improve Chatbots for Code RepositoriesProceedings of the ACM on Software Engineering10.1145/36608121:FSE(2378-2401)Online publication date: 12-Jul-2024
https://dl.acm.org/doi/10.1145/3660812
Gupta RKrishnamoorthy MVangala V(2024)Better Graph Embeddings for Enterprise GraphsProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632412(368-374)Online publication date: 4-Jan-2024
https://dl.acm.org/doi/10.1145/3632410.3632412
Li CZhang ZSaugstad MSafranchik EKulkarni CHuang XPatel SIyer VAlthoff TFroehlich J(2024)LabelAId: Just-in-time AI Interventions for Improving Human Labeling Quality and Domain Knowledge in Crowdsourcing SystemsProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642089(1-21)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642089
Bommisetti RKodumagulla RSrinivas MNalkurti ANandurkar PHoang S(2024)Deep Learning based Predictive Analytics for Employees Turnover with Performance Prediction in Banking Sector2024 3rd International Conference on Sentiment Analysis and Deep Learning (ICSADL)10.1109/ICSADL61749.2024.00057(312-317)Online publication date: 13-Mar-2024
https://doi.org/10.1109/ICSADL61749.2024.00057
Bommisetti RKodumagulla RSrinivas MNalkurti ANandurkar PHoang S(2024)Deep Learning based Predictive Analytics for Employees Turnover with Performance Prediction in Banking Sector2024 International Conference on Inventive Computation Technologies (ICICT)10.1109/ICICT60155.2024.10544552(933-938)Online publication date: 24-Apr-2024
https://doi.org/10.1109/ICICT60155.2024.10544552
Bolla BPattnaik SPatra S(2024)Detection of Objectionable Song Lyrics Using Weakly Supervised Learning and Natural Language Processing TechniquesProcedia Computer Science10.1016/j.procs.2024.04.183235(1929-1942)Online publication date: 2024
https://doi.org/10.1016/j.procs.2024.04.183
Khandaker FSenderovich AZhao JCohen EYu ECarbajales SChan A(2024)Transformer models for mining intents and predicting activities from emails in knowledge-intensive processesEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.107450128(107450)Online publication date: Feb-2024
https://doi.org/10.1016/j.engappai.2023.107450
Shah HJaidka KUngar LFagan JGrosser T(2023)Building a Multimodal Classifier of Email Behavior: Towards a Social Network Understanding of Organizational CommunicationInformation10.3390/info1412066114:12(661)Online publication date: 14-Dec-2023
https://doi.org/10.3390/info14120661
Yuvaraj SBenatallah BMotahari-Nezhad HRabhi F(2023)Identification and Generation of Actions Using Pre-trained Language ModelsWeb Information Systems Engineering – WISE 202310.1007/978-981-99-7254-8_52(674-683)Online publication date: 21-Oct-2023
https://doi.org/10.1007/978-981-99-7254-8_52
Khandaker FSenderovich AYu ECarbajales SChan A(2023)Transformer Models for Activity Mining in Knowledge-Intensive ProcessesBusiness Process Management Workshops10.1007/978-3-031-25383-6_2(13-24)Online publication date: 9-Feb-2023
https://doi.org/10.1007/978-3-031-25383-6_2
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents