Literary and Linguistic Computing Advance Access originally published online on October 27, 2005
Literary and Linguistic Computing 2006 21(3):311-326; doi:10.1093/llc/fqi063
| ||||||||||||||||||||||||||||||||||||||||||||||||||||
A Tool for Literary Studies: Intertextual Distance and Tree Classification
Grenoble I University, France
Grenoble II University, France
Correspondence: Cyril Labbé, LSR-IMAG, 681, rue de la Passerelle, BP. 72, 38402 Saint Martin d'Hères Cedex, France. E-mail: cyril.labbe{at}imag.fr
How to measure proximities and oppositions in large text corpora? Intertextual distance provides a simple and interesting solution. Its properties make it a good tool for text classification, and especially for tree-analysis which is fully presented and discussed here. In order to measure the quality of this classification, two indices are proposed. The method presented provides an accurate tool for literary studiesas is demonstrated by applying it to two areas of French literature, Racine's tragedies and an authorship attribution experiment.