© 1996 by Association for Literary & Linguistic Computing
Outside the cave of shadows: using syntactic annotation to enhance authorship attribution
A1 Catholic University of Nijmegen, Nijmegen, The Netherlands A The University of the West of England, Bristol, UK Z Corresponding author at: Max Planck Institute for Psycholinguistics, Wundtlaan 1, 6525XD, Nijmegen, The Netherlands
This paper reports an experiment in authorship attribution in which statistical measures and methods that have been widely applied to words and their frequencies of use are applied to rewrite rules as they appear in a syntactically annotated corpus. The outcome of this experiment suggests that the frequencies with which syntactic rewrite rules are put to use provide a better clue to authorship than word usage. Complementary methods focusing on the high-frequency head and the low-frequency tail of the distribution independently reveal a higher resolution than traditional word-based analyses, and promise enhanced accuracy for authorship attribution.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K. Luyckx Corpus Linguistics Beyond the Word-Corpus Research from Phrase to Discourse. * Eileen Fitzpatrick (ed.). Lit Linguist Computing, April 1, 2008; 23(1): 123 - 125. [Full Text] [PDF] |
||||
![]() |
K. Altintas, F. Can, and J. M. Patton Language Change Quantification Using Time-separated Parallel Translations Lit Linguist Computing, November 1, 2007; 22(4): 375 - 393. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Hirst and O. Feiguina Bigrams of Syntactic Labels for Authorship Discrimination of Short Texts Lit Linguist Computing, November 1, 2007; 22(4): 405 - 417. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Tambouratzis and M. Vassiliou Employing Thematic Variables for Enhancing Classification Accuracy Within Author Discrimination Experiments Lit Linguist Computing, June 1, 2007; 22(2): 207 - 224. [Abstract] [Full Text] [PDF] |
||||
