Abstract
A new system for spam e-mail annotation by end-users is presented. It is based on the recursive application of hand-written annotation rules by means of an inferential engine based on Logic Programming. Annotation rules allow the user to express nuanced considerations that depend on deobfuscation, word (non-)occurrence and structure of the message in a straightforward, human-readable syntax. We show that a sample collection of annotation rules are effective on a relevant corpus that we have assembled by collecting e-mails that have escaped detection by the industry-standard SpamAssassin filter. The system presented here is intended as a personal tool enforcing personalized annotation rules that would not be suitable for the general e-mail traffic.
A companion Web site to this article, with software, results and the corpus described herewith is at http://informatica.unime.it/rubast/
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Sergeant, M.: Internet-level spam detection and spamassassin 2.50. In: Spam Conference (2003)
Denti, E., Omicini, A., Ricci, A.: Multi-paradigm java-prolog integration in tuprolog. Sci. Comput. Program. 57(2), 217–250 (2005)
Wielemaker, J., Anjewierden, A.: An architecture for making object-oriented systems available from prolog. In: Proc. of the 12th Int’l Workshop on Logic Programming Environments, WLPE 2002 (2002)
Cormack, G.V., Lynam, T.R.: Online supervised spam filter evaluation. ACM Trans. Inf. Syst. 25(3) (2007)
van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworths, London (1979)
Cormack, G.V., Lynam, T.R.: Spam corpus creation for trec. In: Proc. of the Second Conference on Email and Anti-Spam, CEAS 2005 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fiumara, G., Marchi, M., Pagano, R., Provetti, A. (2010). Rule-Based Spam E-mail Annotation. In: Hitzler, P., Lukasiewicz, T. (eds) Web Reasoning and Rule Systems. RR 2010. Lecture Notes in Computer Science, vol 6333. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15918-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-15918-3_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15917-6
Online ISBN: 978-3-642-15918-3
eBook Packages: Computer ScienceComputer Science (R0)