Literary and Linguistic Computing Advance Access originally published online on March 2, 2005
Literary and Linguistic Computing 2005 20(1):103-116; doi:10.1093/llc/fqh046
| ||||||||||||||||||||||||||||||||||||||||||||||||||
Articles |
Unification of XML Documents with Concurrent Markup
Bielefeld University, Bielefeld Justus-Liebig-Universität Gießen
Harald Lüngen, Justus-Liebig-Universität Gießen, FB05 Angewandte Sprachwissenschaft und Computerlinguistik, Otto-Behaghel-Str. 10 D, D-35394 Gießen. E-mail: luengen{at}uni-giessen.de
An approach to the unification of XML (Extensible Markup Language) documents with identical textual content and concurrent markup in the framework of XML-based multi-layer annotation is introduced. A Prolog program allows the possible relationships between element instances on two annotation layers that share PCDATA to be explored and also the computing of a target node hierarchy for a well-formed, merged XML document. Special attention is paid to identity conflicts between element instances, for which a default solution that takes into account metarelations that hold between element types on the different annotation layers is provided. In addition, rules can be specified by a user to prescribe how identity conflicts should be solved for certain element types.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Witt, G. Rehm, E. Hinrichs, T. Lehmberg, and J. Stegmann SusTEInability of linguistic resources through feature structures Lit Linguist Computing, September 1, 2009; 24(3): 363 - 372. [Abstract] [Full Text] [PDF] |
||||
