© 2002 by Association for Literary & Linguistic Computing
Delta: a Measure of Stylistic Difference and a Guide to Likely Authorship
1 University of Newcastle, Australia
This paper is a companion to my Questions of authorship: attribution and beyond, in which I sketched a new way of using the relative frequencies of the very common words for comparing written texts and testing their likely authorship. The main emphasis of that paper was not on the new procedure but on the broader consequences of our increasing sophistication in making such comparisons and the increasing (although never absolute) reliability of our inferences about authorship. My present objects, accordingly, are to give a more complete account of the procedure itself; to report the outcome of an extensive set of trials; and to consider the strengths and limitations of the new procedure. The procedure offers a simple but comparatively accurate addition to our current methods of distinguishing the most likely author of texts exceeding about 1,500 words in length. It is of even greater value as a method of reducing the field of likely candidates for texts of as little as 100 words in length. Not unexpectedly, it works least well with texts of a genre uncharacteristic of their author and, in one case, with texts far separated in time across a long literary career. Its possible use for other classificatory tasks has not yet been investigated.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. Hendrickx What's In A Word-List? Investigating Word Frequency and Keyword Extraction. Dawn Archer (ed.). Lit Linguist Computing, November 17, 2009; (2009) fqp041v1. [Full Text] [PDF] |
||||
![]() |
M. L. Jockers, D. M. Witten, and C. S. Criddle Reassessing authorship of the Book of Mormon using delta and nearest shrunken centroid classification Lit Linguist Computing, February 17, 2009; (2009) fqn040v2. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Argamon Interpreting Burrows's Delta: Geometric and Probabilistic Foundations Lit Linguist Computing, June 1, 2008; 23(2): 131 - 147. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Hirst and O. Feiguina Bigrams of Syntactic Labels for Authorship Discrimination of Short Texts Lit Linguist Computing, November 1, 2007; 22(4): 405 - 417. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. van Dalen-Oskam and J. van Zundert Delta for Middle Dutch Author and Copyist Distinction in Walewein Lit Linguist Computing, September 1, 2007; 22(3): 345 - 362. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Tambouratzis and M. Vassiliou Employing Thematic Variables for Enhancing Classification Accuracy Within Author Discrimination Experiments Lit Linguist Computing, June 1, 2007; 22(2): 207 - 224. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Burrows All the Way Through: Testing for Authorship in Different Frequency Strata Lit Linguist Computing, April 1, 2007; 22(1): 27 - 47. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Garcia and J. C. Martin Function Words in Authorship Attribution Studies Lit Linguist Computing, April 1, 2007; 22(1): 49 - 66. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Rybicki Burrowing into Translation: Character Idiolects in Henryk Sienkiewicz's Trilogy and its Two English Translations Lit Linguist Computing, April 1, 2006; 21(1): 91 - 103. [Abstract] [Full Text] [PDF] |
||||
