Skip Navigation


Literary and Linguistic Computing Advance Access originally published online on September 8, 2006
Literary and Linguistic Computing 2006 21(4):507-527; doi:10.1093/llc/fql040
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
21/4/507    most recent
fql040v2
fql040v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Manni, F.
Right arrow Articles by Nerbonne, J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press on behalf of ALLC and ACH. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

To What Extent are Surnames Words? Comparing Geographic Patterns of Surname and Dialect Variation in the Netherlands

Franz Manni

UMR 5145 CNRS, Musée de l'Homme MNHN, Paris, France

Wilbert Heeringa and John Nerbonne

Alfa-Informatica, Faculty of Arts, University of Groningen, The Netherlands

Correspondence: John Nerbonne, P.O. Box 716, Faculty of Arts, Groningen NL 9700AS, The Netherlands. E-mail: j.nerbonne{at}rug.nl
Since the early studies by Sokal (1988) and Cavalli-Sforza et al. (1989), there has been an increasing interest in depicting the history of human migrations by comparing genetic and linguistic differences that mirror different aspects of human history. Most of the literature concerns continental or macroregional patterns of variation, while regional and microregional scales were investigated less successfully. In this article we concentrate on the Netherlands, an area of only 40,000 km2.

The focus of the article is on the analysis of surnames, which have been proven to be reliable genetic markers since in patrilineal systems they are transmitted—virtually unchanged—along generations, similar to a genetic locus on the Y-chromosome. We shall compare their distribution to that of dialect pronunciations, which are clearly culturally transmitted (children learn one of the linguistic varieties they are exposed to, normally that of their peers in the same area or that of their families). Since surnames, at the time of their introduction, were words subject to the same linguistic processes that otherwise result in dialect differences, one might expect the distribution of surnames to be correlated with dialect pronunciation differences. But we shall argue that once the collinear effects of geography on both genetics and cultural transmission are taken into account, there is in fact no statistically significant association between the two. We show that surnames cannot be taken as a proxy for dialect variation, even though they can be safely used as a proxy to Y-chromosome genetic variation.

We work primarily with regression analyses, which show that both surname and dialect variation are strongly correlated with geographic distance. In view of this strong correlation, we focus on the residuals of the regression, which seeks to explain genetic and linguistic variation on the basis of geography (where geographic distance is the independent variable, and surname diversity or linguistic diversity is the dependent variable). We then seek a more detailed portrait of the geographic patterns of variation by identifying the ‘barriers’ (namely the areas where the residuals are greatest) by applying the Monmonier algorithm.

We find the results historically and geographically insightful, hopefully leading to a deeper understanding of the role of the local migrations and cultural diffusion that are responsible for surname and dialect diversity.


1 Isonymy, a measure of surnames’ overlap in a population, estimates the degree to which the population is related, i.e. its consanguinity. Real isonymy is obtained by counting the number of marriages where partners have the same surname (isonymic marriages). Isonymy can be estimated by computing the probabilities of isonymic marriages for all surnames. The probability depends on the relative frequency and the number of all the different surnames. This latter measure is called ‘random isonymy’ and assumes that the choice of the partner is not influenced by his family name, being—in this respect—completely random. In a village where all the inhabitants have different surnames, isonymy is 0; in another village where all the inhbitants have the same surname, isonymy is 1.

2 Special-purpose software, "Barrier 2.2", is available at http://www.mnhn.fr/mnhn/ecoanthropologie/software/barrier.html

3 To avoid a frequent misunderstanding, we note that the edge associated with the maximum distance does not need to be on the borders of the triangulation, such a case more an exception than the rule. If this is the case, the extension of the boundary occurs in one direction only; otherwise, it takes place in two directions simultaneously.

4 Here we adopt the standard scientific notation where (*) means a significance level of 5%, while (**) and (***), respectively, indicate significance levels of at 1% and 0.1%.

5 Population genetic differences heavily depend on demographic phenomena.

6 This claim specially applies to the rural Netherlands. It must be emphasised that in middle of the seventeenth century a large proportion of the Dutch population was already living in towns and cities.

7 By patrilocality we mean a residential pattern in which a married couple settles in the husband's home or community.

8 By the way, this is the reason why the scientific definition of races does not apply to humans; they are too similar to be partitioned into separate, biologically meaningful groups.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Lit Linguist ComputingHome page
H. Goebl
Recent Advances in Salzburg Dialectometry
Lit Linguist Computing, November 1, 2006; 21(4): 411 - 435.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.