Skip Navigation

Literary and Linguistic Computing 1999 14(1):11-28; doi:10.1093/llc/14.1.11
© 1999 by Association for Literary & Linguistic Computing
This Article
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Andreev, L
Right arrow Articles by Olsen, M
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Re-engineering a war-machine: ARTFL's Encyclopédie

L AndreevA1, J IversonA2 and M OlsenZ

A1 Harvard University, Cambridge, MA, USA A2 University of Missouri-Columbia, Columbia, MO, USA Z Corresponding author at: ARTFL Project, Department of Romance Languages, University of Chicago, Chicago, IL 60637, USA. E-mail: mark@barkov.uchicago.edu

Current circumstances, specifically (i) competition from commercial developers and (ii) the need for compatibility between disparate data sources, suggest that it is crucially important to reconsider the ways in which the guidelines of the Text Encoding Initiative (TEI) are applied to computing projects in the humanities. Based on our work with Diderot and d'Alembert's Encycloédie, we contend that, in many cases light, automatically generated tagging is preferable to extensive manual mark-up. The massive size nd complex textual structures of this work made it imperative to devise procedures that would eliminate the need for hand editing. Data capture for this project was limited to clear typographic conventions using HTML conventions and simplified SGML-style tags. All identification of textual units (such as articles and cross-references) and textual attributes (such as authorship and subject headings) was then carried out automatically. The resulting hierarchical units (articles works, paragraphs) can be queried in diverse ways using systems developed by the ARTFL Project. These systems provide full-test retrieval and full-text searching, either in the full text or in a sub-corpus defined by the user. The proven viability of these procedures leads us to assert that this model could be applied profitably to a much wider range of projects where cost-effectiveness and flexibility are desirable.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.