Abstract
Automatic text classification techniques are applied to the problem of quantifying strength of characterization within plays, using a case study of the works of four sample playwrights that are freely available in machine-readable form. Strong characters are those whose speeches constitute homogeneous categories in comparison with other characters—their speeches are more attributable to themselves than to their play or their author.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Biber, D.: Dimensions of Register Variation. Cambridge University Press, Cambridge (1995)
Chaski, C.: Who Wrote It? Steps Toward a Science of Authorship Identification. National Institute of Justice Journal 233, 15–22 (1997)
Chaski, C.: Empirical Evaluations of Language-based Author Identification Techniques. Forensic Linguistics 8(1), 1–65 (2001)
Coleman, C.: The Treatise Lorenzo Valla on the Donation of Constantine: Text and Translation. Russell & Russell, New York (1971); First published 1922
Farmer, T., Christiansen, M., Monaghan, P.: Phonological Typicality Influences On-line Sentence Comprehension. Proceedings of the National Academy of Sciences of the United States of America 103(32), 12203–12208 (2006)
Foster, D.: Author Unknown. On the Trail of Anonymous. Macmillan, London (2001)
Frontini, F., Lynch, G., Vogel, C.: Revisiting the Donation of Constantine. In: Kibble, R., Rauchas, S. (eds.) 2008 Artificial Intelligence and Simulation of Behavior – Symposium: Style in Text
Healey, P.G.T., Vogel, C., Eshghi, A.: Group Dialects in an Online Community. In: Arnstein, R., Vieu, L. (eds.) DECALOG 2007, The 10th Workshop on the Semantics and Pragmatics of Dialogue, Università di Trento (Italy), May 30 – June 1, 2007, pp. 141–147 (2007)
Hogan, L.: A Corpus Linguistic Analysis of American, British and Irish Political Speeches. Master’s thesis, Centre for Language and Communication Studies, Trinity College, University of Dublin (2005)
Kilgarriff, A., Salkie, R.: Corpus Similarity and Homogeneity via Word Frequency. In: Proceedings of Euralex 1996 (1996)
Kilgarriff, A.: Comparing Corpora. International Journal of Corpus Linguistics 6(1), 97–133 (2001)
Laver, M. (ed.): Estimating the Policy Position of Political Actors. Routledge (2001)
Laver, M., Garry, J.: Estimating Policy Positions from Political Texts. American Journal of Political Science 44(3), 619–634 (2000)
Leech, G.N., Short, M.H.: Style in Fiction: A Linguistic Introduction to English Fictional Prose. Longman, London (1981)
Lynch, G., Vogel, C.: Automatic Character Assignation. In: Bramer, M. (ed.) AI-2007 Twenty-seventh SGAI International Conference on Artificial Intelligence, pp. 335–348. Springer, Heidelberg (2007)
Mencke, M.: Benchmarking a Text Classification Technique. Master’s thesis, Computational Linguistics Group, Trinity College Dublin (2007)
Nijholt, A., Reidsma, D., Ruttkay, Z., van Welbergen, H., Bos, P.: Non-verbal and Bodily Interaction in Ambient Entertainment. In: Esposito, A., Keller, E., Marinaro, M., Bratanic, M. (eds.) The Fundamentals of Verbal and Non-Verbal Communication and the Biometrical Issue, pp. 343–348. IOS Press, Amsterdam (2007)
Oakes, M.P.: Statistics for Corpus Linguistics. Edinburgh Textbooks in Empirical Linguistics. Edinburgh University Press, Edinburgh (1998)
O’Brien, C., Vogel, C.: Spam Filters: Bayes vs. Chi-squared; Letters vs. Words. In: Alesky, M., et al. (ed.) Proceedings of the International Symposium on Information and Communication Technologies, pp. 298–303 (2003)
Parry, A.: The making of Homeric verse: the collected papers of Milman Parry. Oxford University Press, Oxford (1971) (Reprinted 1987)
Schmid, H.: Probabilistic Part-of-Speech Tagging using Decision Trees. In: International Conference on New Methods in Language Processing (1994)
Smith, J.D.: Winged Words Revisited: Diction and Meaning in Indian Epic. Bulletin of the School of Oriental and African Studies, University of London 62(2), 267–305 (1999)
Van Gijsel, S., Vogel, C.: Inducing a Cline from Corpora of Political Manifestos. In: Aleksy, M., et al. (eds.) Proceedings of the International Symposium on Information and Communication Technologies, pp. 304–310 (2003)
Vogel, C.: Corpus Homogeneity and Bernoulli Schema. In: Mining Massive Data Sets for Security. NATO Advanced Study Institute, pp. 93–94 (2007)
Vogel, C.: N-gram Distributions in Texts as Proxy for Textual Fingerprints. In: Esposito, A., Keller, E., Marinaro, M., Bratanic, M. (eds.) The Fundamentals of Verbal and Non-Verbal Communication and the Biometrical Issue, pp. 189–194. IOS Press, Amsterdam (2007)
Vogel, C., Brisset, S.: Hearing Voices in the Poetry of Brendan Kennelly. In: Varieties of Voice, 2006. 3rd international BAAHE conference. Leuven, Revised version Belgian Journal of English Language & Literature, December 7-9 (to appear, 2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vogel, C., Lynch, G. (2008). Computational Stylometry: Who’s in a Play?. In: Esposito, A., Bourbakis, N.G., Avouris, N., Hatzilygeroudis, I. (eds) Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction. Lecture Notes in Computer Science(), vol 5042. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70872-8_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-70872-8_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70871-1
Online ISBN: 978-3-540-70872-8
eBook Packages: Computer ScienceComputer Science (R0)