Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1007/978-3-030-03493-1_35guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Weighted Voting and Meta-Learning for Combining Authorship Attribution Methods

Published: 21 November 2018 Publication History

Abstract

Our research concentrates on ways to combine machine learning techniques for authorship attribution. Traditionally, research in authorship attribution is focused on the development of new base-classifiers (combinations of stylometric features and learning methods). A large number of base-classifiers developed for authorship attribution vary in accuracy, often proposing different authors for a disputed document. In this research, we use predictions of multiple base-classifiers as a knowledge base for learning the true author.
We introduce and compare two novel methods that utilize multiple base-classifiers. In the Weighted Voting approach, each base-classifier supports an author in proportion to its accuracy in leave-one-out classification. In our Meta-Learning approach, each base-classifier is treated as a feature and methods’ predictions in leave-one-out cross-validation are used as training data from which machine learning methods produce an aggregated decision.
We illustrate our results through a collection of 18th century political writings. Anonymously written essays were common during this period, leading to frequent disagreements between scholars over their attribution.

References

[1]
Love H Attributing Authorship: An Introduction 2002 Cambridge Cambridge University Press
[2]
Stamatatos E Authorship attribution based on feature set subspacing ensembles Int. J. Artif. Intell. Tools 2006 15 823-838
[3]
Ryan, M., Noecker, J.: Mixture of Experts Authorship Attribution Notebook for PAN at CLEF 2012 (2012)
[4]
Berton G, Petrovic S, Ivanov L, and Schiaffino R Cleary S and Stabell IL Examining the Thomas Paine Corpus: automated computer authorship attribution methodology applied to Thomas Paine’s writings New Directions in Thomas Paine Studies 2016 New York Palgrave Macmillan US 31-47
[5]
Petrovic S, Berton G, Campbell S, and Ivanov L Attribution of 18th century political writings using machine learning J. Technol. Soc. 2015 11 1-13
[6]
Stamatatos E A survey of modern authorship attribution methods J. Am. Soc. Inf. Sci. Technol. 2009 60 538-556
[7]
Koppel M, Schler J, and Argamon S Computational methods in authorship attribution J. Am. Soc. Inf. Sci. Technol. 2008 60 9-26
[8]
Mosteller, F., Wallace, D.L.: Inference and disputed authorship: the Federalist. Center for the Study of Language and Information (1964)
[9]
Toutanova, K., Klein, D., Manning, C.D., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: NAACL 2003, pp. 173–180. Association for Computational Linguistics, Morristown (2003)
[10]
Balota DA, Yap MJ, Cortese MJ, et al. The English lexicon project Behav. Res. Methods 2007 39 445-459
[11]
Porter MF An algorithm for suffix stripping Program 1980 14 130-137
[12]
Hall M, Frank E, Holmes G, et al. The WEKA data mining software ACM SIGKDD Explor. Newsl. 2009 11 10
[13]
Juola Patrick Authorship Attribution Foundations and Trends® in Information Retrieval 2007 1 3 233-334

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
Intelligent Data Engineering and Automated Learning – IDEAL 2018: 19th International Conference, Madrid, Spain, November 21–23, 2018, Proceedings, Part I
Nov 2018
889 pages
ISBN:978-3-030-03492-4
DOI:10.1007/978-3-030-03493-1

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 21 November 2018

Author Tags

  1. Authorship attribution
  2. Combining classifiers
  3. Meta-Learning

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media