Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3041021.3055166acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Email Category Prediction

Published: 03 April 2017 Publication History

Abstract

According to recent estimates, about 90% of consumer received emails are machine-generated. Such messages include shopping receipts, promotional campaigns, newsletters, booking confirmations, etc. Most such messages are created by populating a fixed template with a small amount of personalized information, such as name, salutation, reservation numbers, dates, etc. Web mail providers (Gmail, Hotmail, Yahoo) are leveraging the structured nature of such emails to extract salient information and use it to improve the user experience: e.g. by automatically entering reservation data into a user calendar, or by sending alerts about upcoming shipments. To facilitate these extraction tasks it is helpful to classify templates according to their category, e.g. restaurant reservations or bill reminders, since each category triggers a particular user experience.
Recent research has focused on discovering the causal thread of templates, e.g. inferring that a shopping order is usually followed by a shipping confirmation, an airline booking is followed by a confirmation and then by a "ready to check in" message, etc. Gamzu et al. took this idea one step further by implementing a method to predict the template category of future emails for a given user based on previously received templates. The motivation is that predicting future emails has a wide range of potential applications, including better user experiences (e.g. warning users of items ordered but not shipped), targeted advertising (e.g. users that recently made a flight reservation may be interested in hotel reservations), and spam classification (a message that is part of a legitimate causal thread is unlikely to be spam).
The gist of the Gamzu et al. approach is modeling the problem as a Markov chain, where the nodes are templates or temporal events (e.g. the first day of the month). This paper expands on their work by investigating the use of neural networks for predicting the category of emails that will arrive during a fixed-sized time window in the future. We consider two types of neural networks: multi-layer perceptrons (MLP), a type of feedforward neural network; and long short-term memory (LSTM), a type of recurrent neural network. For each type of neural network, we explore the effects of varying their configuration (e.g. number of layers or number of neurons) and hyper-parameters (e.g. drop-out ratio). We find that the prediction accuracy of neural networks vastly outperforms the Markov chain approach, and that LSTMs perform slightly better than MLPs. We offer some qualitative interpretation of our findings and identify some promising future directions.

References

[1]
M. Aery and S. Chakravarthy. eMailSift: Email classification based on structure and content. In 5th IEEE International Conference on Data Mining, pages 1--8, 2005.
[2]
N. Ailon, Z. S. Karnin, E. Liberty, and Y. Maarek. Threading machine generated email. In 6th ACM International Conference on Web Search and Data Mining, pages 405--414, 2013.
[3]
Z. Bar-Yossef and N. Kraus. Context-sensitive query auto-completion. In 20th International Conference on World Wide Web, pages 107--116, 2011.
[4]
P. Bermejo, J. A. Gámez, and J. M. Puerta. Improving the performance of naive bayes multinomial in e-mail foldering by introducing distribution-based balance of datasets. Expert Systems with Applications, 38(3):2072--2080, 2011.
[5]
J. D. Brutlag and C. Meek. Challenges of the email domain for text classification. In 8th International Conference on Machine Learning, pages 103--110, 2000.
[6]
S. Chakravarthy, A. Venkatachalam, and A. Telang. A graph-based approach for multi-folder email classification. In 10th IEEE International Conference on Data Mining, pages 78--87, 2010.
[7]
W. W. Cohen. Learning rules that classify e-mail. In 1996 AAAI Spring Symposium on Machine Learning in Information Access, pages 18--25, 1996.
[8]
D. Di Castro, Z. Karnin, L. Lewin-Eytan, and Y. Maarek. You've got mail, and here is what you could do with it!: Analyzing and predicting actions on email messages. In 9th ACM International Conference on Web Search and Data Mining, pages 307--316, 2016.
[9]
I. Gamzu, Z. Karnin, Y. Maarek, and D. Wajc. You will get mail! Predicting the arrival of future email. In 24th International Conference on World Wide Web, pages 1327--1332, 2015.
[10]
I. Goodfellow, Y. Bengio, and A. Courville. Deep learning. Book in preparation for MIT Press, 2016.
[11]
A. Graves. Neural networks. In Supervised Sequence Labelling with Recurrent Neural Networks, pages 15--35. Springer, 2012.
[12]
M. Grbovic, G. Halawi, Z. Karnin, and Y. Maarek. How many folders do you really need?: Classifying email into a handful of categories. In 23rd ACM International Conference on Conference on Information and Knowledge Management, pages 869--878, 2014.
[13]
R. Gupta, G. Liang, H.-P. Tseng, R. K. Holur Vijay, X. Chen, and R. Rosales. Email volume optimization at LinkedIn. In 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 97--106, 2016.
[14]
S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8):1735--1780, 1997.
[15]
J.-Y. Jiang, Y.-Y. Ke, P.-Y. Chien, and P.-J. Cheng. Learning user reformulation behavior for query auto-completion. In 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pages 445--454, 2014.
[16]
A. Kannan, K. Kurach, S. Ravi, T. Kaufmann, A. Tomkins, B. Miklos, G. Corrado, L. Lukács, M. Ganea, P. Young, and V. Ramavajjala. Smart reply: Automated response suggestion for email. In 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 495--503, 2016.
[17]
D. Kingma and J. Ba. Adam: A method for stochastic optimization. arXiv preprint:1412.6980, 2014.
[18]
S. Kiritchenko and S. Matwin. Email classification with co-training. In 2001 Conference of the Center for Advanced Studies on Collaborative Research, page 8, 2001.
[19]
B. Klimt and Y. Yang. The Enron corpus: A new dataset for email classification research. In 15th European Conference on Machine Learning, pages 217--226, 2004.
[20]
Y. Koren, E. Liberty, Y. Maarek, and R. Sandler. Automatically tagging email by leveraging other users' folders. In 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 913--921, 2011.
[21]
Y. Maarek. Is mail the next frontier in search and data mining? In 9th ACM International Conference on Web Search and Data Mining, pages 203--203, 2016.
[22]
V. Pham, T. Bluche, C. Kermorvant, and J. Louradour. Dropout improves recurrent neural networks for handwriting recognition. In 14th International Conference on Frontiers in Handwriting Recognition, pages 285--290, 2014.
[23]
L. Prechelt. Automatic early stopping using cross validation: quantifying the criteria. Neural Networks, 11(4):761--767, 1998.
[24]
J. Provost. Naïve-bayes vs. rule-learning in classification of email. University of Texas at Austin, 1999.
[25]
W. J. Stewart. Probability, Markov Chains, Queues, and Simulation: The Mathematical Basis of Performance Modeling. Princeton University Press, Princeton, NJ, USA, 2009.
[26]
B. Wang, M. Ester, J. Bu, Y. Zhu, Z. Guan, and D. Cai. Which to view: Personalized prioritization for broadcast emails. In 25th International Conference on World Wide Web, pages 1181--1190, 2016.
[27]
B. Wang, M. Ester, Y. Liao, J. Bu, Y. Zhu, Z. Guan, and D. Cai. The million domain challenge: Broadcast email prioritization by cross-domain recommendation. In 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1895--1904, 2016.
[28]
J. B. Wendt, M. Bendersky, L. Garcia-Pueyo, V. Josifovski, B. Miklos, I. Krka, A. Saikia, J. Yang, M.-A. Cartright, and S. Ravi. Hierarchical label propagation and discovery for machine generated email. In 9th ACM International Conference on Web Search and Data Mining, pages 317--326, 2016.
[29]
K. Yao, G. Zweig, M.-Y. Hwang, Y. Shi, and D. Yu. Recurrent neural networks for language understanding. In 14th Annual Conference of the International Speech Communication Association, pages 2524--2528, 2013.
[30]
W. Zaremba, I. Sutskever, and O. Vinyals. Recurrent neural network regularization. arXiv preprint:1409.2329, 2014.
[31]
A. Zhang, A. Goyal, R. Baeza-Yates, Y. Chang, J. Han, C. A. Gunter, and H. Deng. Towards mobile query auto-completion: An efficient mobile application-aware approach. In 25th International Conference on World Wide Web, pages 579--590, 2016.
[32]
W. Zhang, A. Ahmed, J. Yang, V. Josifovski, and A. J. Smola. Annotating needles in the haystack without looking: Product information extraction from emails. In 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 2257--2266, 2015.

Cited By

View all
  • (2024)Prediction of metal recovery potential of end-of-life NEV batteries in China based on GRA-BiLSTMWaste Management10.1016/j.wasman.2024.10.002190(339-349)Online publication date: Dec-2024
  • (2019)Clinical text classification research trends: Systematic literature review and open issuesExpert Systems with Applications10.1016/j.eswa.2018.09.034116(494-520)Online publication date: Feb-2019
  • (2018)Beacon in the DarkProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3269231(1871-1874)Online publication date: 17-Oct-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web Companion
April 2017
1738 pages
ISBN:9781450349147

Sponsors

  • IW3C2: International World Wide Web Conference Committee

In-Cooperation

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 03 April 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. email
  2. prediction
  3. time series analysis

Qualifiers

  • Research-article

Conference

WWW '17
Sponsor:
  • IW3C2

Acceptance Rates

WWW '17 Companion Paper Acceptance Rate 164 of 966 submissions, 17%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)2
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Prediction of metal recovery potential of end-of-life NEV batteries in China based on GRA-BiLSTMWaste Management10.1016/j.wasman.2024.10.002190(339-349)Online publication date: Dec-2024
  • (2019)Clinical text classification research trends: Systematic literature review and open issuesExpert Systems with Applications10.1016/j.eswa.2018.09.034116(494-520)Online publication date: Feb-2019
  • (2018)Beacon in the DarkProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3269231(1871-1874)Online publication date: 17-Oct-2018
  • (2018)Hidden in Plain SightProceedings of the 2018 World Wide Web Conference10.1145/3178876.3186167(1865-1874)Online publication date: 10-Apr-2018
  • (2018)A Framework for Automatic Generation of FAQs from Email Repositories2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)10.1109/IEMCON.2018.8614894(1035-1041)Online publication date: Nov-2018
  • (2018)Learning Effective Embeddings for Machine Generated Emails with Applications to Email Category Prediction2018 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2018.8622048(1846-1855)Online publication date: Dec-2018
  • (2018)Template Trees: Extracting Actionable Information from Machine Generated EmailsDatabase and Expert Systems Applications10.1007/978-3-319-98812-2_1(3-18)Online publication date: 9-Aug-2018
  • (2017)Data mining on social networks for target advertisement in automobile sector2017 International Conference on Infocom Technologies and Unmanned Systems (Trends and Future Directions) (ICTUS)10.1109/ICTUS.2017.8285998(166-170)Online publication date: Dec-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media