Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2835776.2835811acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

You've got Mail, and Here is What you Could do With It!: Analyzing and Predicting Actions on Email Messages

Published: 08 February 2016 Publication History

Abstract

With email traffic increasing, leading Web mail services have started to offer features that assist users in reading and processing their inboxes. One approach is to identify "important" messages, while a complementary one is to bundle messages, especially machine-generated ones, in pre-defined categories. We rather propose here to go back to the task at hand and consider what actions the users might conduct on received messages. We thoroughly studied, in a privacy-preserving manner, the actions of a large number of users in Yahoo mail, and found out that the most frequent actions are typically read, reply, delete and a sub-type of delete, delete-without-read. We devised a learning framework for predicting these four actions, for users with various levels of activity per action. Our framework leverages both vertical learning for personalization and horizontal learning for regularization purposes. In order to verify the quality of our predictions, we conducted a large-scale experiment involving users who had previously agreed to participate in such research studies. Our results show that, for recall values of 90%, we can predict important actions such as read or reply at precision levels up to 40% for active users, which we consider pretty encouraging for an assistance task. For less active users, we show that our regularization achieves an increase in AUC of close to 50%. To the best of our knowledge, our work is the first to provide a unified framework of this scale for predicting multiple actions on Web email, which hopefully provides a new ground for inventing new user experiences to help users process their inboxes.

References

[1]
D. Aberdeen, O. Pacovsky, and A. Slater. The learning behind gmail priority inbox. In LCCC: NIPS 2010 Workshop on Learning on Cores, Clusters and Clouds. Elsevier, 2010.
[2]
N. Ailon, Z. Karnin, E. Liberty, and Y. Maarek. Threading machine generated email. In Proceedings of WSDM'2013, pages 405--414, New York, NY, USA, 2013.
[3]
I. Alberts and D. Forest. Email pragmatics and automatic classification: A study in the organizational context. JASIST, 63(5):904--922, 2012.
[4]
D. Arthur and S. Vassilvitskii. k-means++: The advantages of careful seeding. In Proceedings of SODA'2007, New Orleans, LA, Jan 2007.
[5]
T. Ayodele and S. Zhou. Applying machine learning techniques for e-mail management: solution with intelligent e-mail reply prediction. Journal of Engineering and Technology Research, 1(7):143--151, 2009.
[6]
S. Bickel and T. Scheffer. Dirichlet-enhanced spam filtering based on biased samples. Advances in neural information processing systems, 19:161, 2007.
[7]
W. Cohen. Learning rules that classify e-mail. In AAAI Spring Symposium on Machine Learning in Information Access, pages 18--25, 1996.
[8]
L. Dabbish, R. Kraut, S. Fussell, and S. Kiesler. Understanding email use: predicting action on a message. In Proceedings of CHI'2005, Portland,OR, Apr 2005.
[9]
L. A. Dabbish, G. Venolia, and J.J. Cadiz. Marked for deletion: An analysis of email data. In Proceedings of CHI'2003, Fort Lauderdale, FL, Apr 2003.
[10]
H. Daumé III. Frustratingly easy domain adaptation. arXiv preprint arXiv:0907.1815, 2009.
[11]
P. Deepak, D. Garg, and V. Varshney. Analysis of enron email threads and quantification of employee responsiveness. In Workshop on Text Mining and Link Analysis, 2007.
[12]
Y. Diao, H. Lu, and D. Wu. A comparative study of classification based personal e-mail filtering. In Proceedings of PAKDD'2000, Kyoto, Japan, Apr 2000.
[13]
M. Dredze, J. Blitzer, and F. Pereira. Reply expectation prediction for email management. In Proceedings of CEAS'2005, Stanford, CA, Jul 2005.
[14]
M. Dredze, T. Brooks, J. Carroll, J. Magarick, J. Blitzer, and F. Pereira. Intelligent email: Reply and attachment prediction. In Proceedings of IUI'2008, Canary Islands, Spain, Jan 2008.
[15]
J. Duffy. Google inbox for gmail (for iphone). PC Magazine, Oct 2014.
[16]
S. Dumais, E. Cutrell, JJ. Cadiz, G. Jancke, R. Sarin, and DC. Robbins. Stuff I've seen: A system for personal information retrieval and re-use. In Proceedings of SIGIR, pages 72--79, Toronto, Canada, 2003.
[17]
T. Evgeniou and M. Pontil. Regularized multi-task learning. In Proceedings of KDD'2004, Seattle, WA, Aug 2004.
[18]
M. Grbovic, G. Halawi, Z. Karnin, and Y. Maarek. How many folders do you really need? classifying email into a handful of categories. In Proceedings of CIKM'2014, 2014.
[19]
T. Hastie, J. Friedman, and R. Tibshirani. The elements of statistical learning: Data Mining, Inference, Prediction, volume 2. Springer, 2009.
[20]
S. Kiritchenko and S. Matwin. Email classification with co-training. In Proceedings of the 2011 Conference of the Center for Advanced Studies on Collaborative Research, pages 301--312, 2011.
[21]
B. Klimt and Y. Yang. The enron corpus: A new dataset for email classification research. In Proceedings of ECML'2004, Pisa, Italy, Sep 2004.
[22]
F. Kooti, L.M. Aiello, M. Grbovic, K. Lerman, and A. Mantrach. Evolution of conversations in the age of email overload. In Proceedings of the 24th International Conference on World Wide Web, WWW '15, pages 603--613, 2015.
[23]
Y. Koren, E. Liberty, Y. Maarek, and R. Sandler. Automatically tagging email by leveraging other users' folders. In Proceedings of KDD'2011, San Diego, CA, Aug 2011.
[24]
W. Mackay. Diversity in the use of electronic mail: A preliminary inquiry. ACM Transactions on Office Information Systems, 6(4):380--397, 1998.
[25]
C.D. Manning, P. Raghavan, and H. Schütze. Introduction to information retrieval, volume 1. Cambridge university press Cambridge, 2008.
[26]
Derek Mead. Aol's new alto client is visual email, and you don't need a new address. MotherBoard, Oct 2012.
[27]
B. On, E. Lim, A. Purandare, and L. Teow. Mining interaction behaviors for email reply order prediction. In Proceedings of ASONAM'2010, pages 306--310, 2010.
[28]
S.J. Pan and Q. Yang. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22(10):1345--1359, 2010.
[29]
P. Resnick. Internet message format, rfc 5322, October 2008.
[30]
M. Sappelli, S. Verberne, and W. Kraaij. Combining textual and non-textual features for e-mail importance estimation. In Proceedings of the 25th Benelux Conference on Artificial Intelligence, 2013.
[31]
S. Shalev-Shwartz. Online learning and online convex optimization. Foundations and Trends in Machine Learning, 4(2):107--194, 2011.
[32]
H. Shimodaira. Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of statistical planning and inference, 90(2), 2000.
[33]
L. Sproull and S. Kiesler. Connections: New ways of working in the networked organization. MIT Press: Cambridge, MA, USA, 1991.
[34]
W. Steve and S. Candace. Email overload: exploring personal information management of email. In Proceedings of CHI'96, Vancouver,CA, Apr 1996.
[35]
B. Taylor. Sender reputation in a large webmail service. In Proceedings of CEAS'2006, Mountain View, CA, Jul 2006.
[36]
J. Tyler and J. Tang. When can i expect an email response? a study of rhythms in email usage. In Proceedings of ECSCW'2003, pages 239--258, Helsinki, Finland, Sep 2003.
[37]
W.T. Yih, R. McCann, and A. Kolcz. Improving spam filtering by detecting gray mail. In Proceedings of CEAS'2007, Mountain View, CA, Aug 2007.

Cited By

View all
  • (2023)EmFore: Online Learning of Email Folder Classification RulesProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614863(2280-2290)Online publication date: 21-Oct-2023
  • (2023)“We Need a Big Revolution in Email Advertising”: Users’ Perception of Persuasion in Permission-based Advertising EmailsProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581163(1-21)Online publication date: 19-Apr-2023
  • (2023)Predicting Email Opens with Domain-Sensitive Affect DetectionComputational Linguistics and Intelligent Text Processing10.1007/978-3-031-23804-8_6(71-79)Online publication date: 26-Feb-2023
  • Show More Cited By

Index Terms

  1. You've got Mail, and Here is What you Could do With It!: Analyzing and Predicting Actions on Email Messages

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WSDM '16: Proceedings of the Ninth ACM International Conference on Web Search and Data Mining
    February 2016
    746 pages
    ISBN:9781450337168
    DOI:10.1145/2835776
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 February 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. actions
    2. email
    3. email message
    4. predicting actions

    Qualifiers

    • Research-article

    Conference

    WSDM 2016
    WSDM 2016: Ninth ACM International Conference on Web Search and Data Mining
    February 22 - 25, 2016
    California, San Francisco, USA

    Acceptance Rates

    WSDM '16 Paper Acceptance Rate 67 of 368 submissions, 18%;
    Overall Acceptance Rate 414 of 2,352 submissions, 18%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)26
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 16 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)EmFore: Online Learning of Email Folder Classification RulesProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614863(2280-2290)Online publication date: 21-Oct-2023
    • (2023)“We Need a Big Revolution in Email Advertising”: Users’ Perception of Persuasion in Permission-based Advertising EmailsProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581163(1-21)Online publication date: 19-Apr-2023
    • (2023)Predicting Email Opens with Domain-Sensitive Affect DetectionComputational Linguistics and Intelligent Text Processing10.1007/978-3-031-23804-8_6(71-79)Online publication date: 26-Feb-2023
    • (2022)Do patients read emails from their physician containing tips on improving lifestyle habits? A pilot studyInternational Journal of Medical Informatics10.1016/j.ijmedinf.2022.104967(104967)Online publication date: Dec-2022
    • (2021)Improving Cloud Storage Search with User ActivityProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441780(508-516)Online publication date: 8-Mar-2021
    • (2021)“I Can’t Reply with That”: Characterizing Problematic Email Reply SuggestionsProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445557(1-18)Online publication date: 6-May-2021
    • (2021)A Novel Approach to Control Emails Notification using NLPProcedia Computer Science10.1016/j.procs.2021.05.097189(224-231)Online publication date: 2021
    • (2020)Leveraging User Email Actions to Improve Ad-Close PredictionProceedings of the 29th ACM International Conference on Information & Knowledge Management10.1145/3340531.3412093(2293-2296)Online publication date: 19-Oct-2020
    • (2020)Why Johnny Can't Unsubscribe: Barriers to Stopping Unwanted EmailProceedings of the 2020 CHI Conference on Human Factors in Computing Systems10.1145/3313831.3376165(1-12)Online publication date: 21-Apr-2020
    • (2020)Prioritizing unread e-mails: people send urgent responses before important or short onesHuman–Computer Interaction10.1080/07370024.2020.1835481(1-24)Online publication date: 23-Nov-2020
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media