Skip Navigation

Literary and Linguistic Computing 2004 19(4):453-475; doi:10.1093/llc/19.4.453
© 2004 by Association for Literary & Linguistic Computing
This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Hoover, D. L.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Testing Burrows's Delta

David L. Hoover*

New York University, USA

Delta, a simple measure of the difference between two texts, has been proposed by John F. Burrows as a tool in authorship attribution problems, particularly in large ‘open’ problems in which conventional methods of attribution are not able to limit the claimants effectively. This paper tests Delta's effectiveness and accuracy, and shows that it works nearly as well on prose as it does on poetry. It also shows that much larger numbers of frequent words are even more accurate than the 150 that Burrows tested. Automated methods that allow for tests on large numbers of differently selected words show that removing personal pronouns and words for which a single text supplies most of the occurrences greatly increases the accuracy of Delta tests. Further tests suggest that large changes in Delta and Delta z-scores from the likeliest to the second likeliest author typically characterize correct attributions, that differences in point of view among the texts are more significant than differences in nationality, and that combining several texts for each author in the primary set reduces the effect of intra-author variability. Although Delta occasionally produces errors in attribution with characteristics that would normally lead to a great deal of confidence, the results presented here confirm its usefulness in the preliminary stages of authorship attribution problems.


* Correspondence: David L. Hoover, Department of English, New York University, 19 University Place, 5th Floor New York, NY 10003, USA. E-mail: david.hoover{at}nyu.edu


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Lit Linguist ComputingHome page
M. L. Jockers, D. M. Witten, and C. S. Criddle
Reassessing authorship of the Book of Mormon using delta and nearest shrunken centroid classification
Lit Linguist Computing, February 17, 2009; (2009) fqn040v2.
[Abstract] [Full Text] [PDF]


Home page
Lit Linguist ComputingHome page
S. Argamon
Interpreting Burrows's Delta: Geometric and Probabilistic Foundations
Lit Linguist Computing, June 1, 2008; 23(2): 131 - 147.
[Abstract] [Full Text] [PDF]


Home page
Lit Linguist ComputingHome page
K. van Dalen-Oskam and J. van Zundert
Delta for Middle Dutch Author and Copyist Distinction in Walewein
Lit Linguist Computing, September 1, 2007; 22(3): 345 - 362.
[Abstract] [Full Text] [PDF]


Home page
Lit Linguist ComputingHome page
J. Burrows
All the Way Through: Testing for Authorship in Different Frequency Strata
Lit Linguist Computing, April 1, 2007; 22(1): 27 - 47.
[Abstract] [Full Text] [PDF]


Home page
Lit Linguist ComputingHome page
A. M. Garcia and J. C. Martin
Function Words in Authorship Attribution Studies
Lit Linguist Computing, April 1, 2007; 22(1): 49 - 66.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.