© 1999 by Association for Literary & Linguistic Computing
Word length frequency and distribution in English: part II. An empirical and mathematical examination of the character and consequences of isometric lineation
Faculty of Integrated Human Studies, Kyoto University, Kyoto 606-8501, Japan Z Corresponding author E-mail: aoyama@phys.h.Kyoto-u.ac.jp
In this paper we build on earlier observations and theory regarding word length frequency and sequential distribution to develop a mathematical characterization of some of the language features distinguishing isometrically lineated text from unlineated text, in other words the features distinguishing isometrical verse from prose. It is shown that the frequency Qn of n syllables making complete words produces a flat distribution for prose, whereas that for verse exhibits peaks at the line length position and subsequent multiples of that position. Data from several verse authors are presented, including a detailed mathematical analysis of the dynamics underlying Qn peak creation, and comments are offered on the processes by which authors construct lines. We note that the word length sequence of prose is random, whereas lineation necessitates non-random word length sequencing, and that this has the probable consequence of introducing a degree of randomness into the otherwise highly ordered grammatical sequence. In addition, we observe that this effect can be ameliorated by a reduction in the mean word length of the text (confirming earlier empirical observations that verse tends to use shorter words than would otherwise have been selected), and also by the use of lines varying from the core isometrical set. The frequency of variant lines is shown to be coincident with the frequency of polysyllables, suggesting that the use of variants is motivated by polysyllabic word placement. The restrictiveness effects of different line lengths, the relationship between metrical restriction and poetic effect, and the general character of metrical rules are also discussed.