© 1996 by Association for Literary & Linguistic Computing
A hybrid disambiguation model for prepositional phrase attachment
Department of Computer Science, The University of Electro-Communication, Chofu, Tokyo, Japan Z Corresponding author
Prepositional Phrase (PP) attachment is a major cause of structural ambiguity in natural language. Many proposals have increasingly relied on large-scale corpus to resolve this problem. However, this approach encounters the notorious sparse-data problem that produces poor results on disambiguation. We in this paper offer a hybrid method which integrates corpus-based approach with knowledge-based techniques for PP attachment disambiguation. It explores a wide-variety of information, including co-occurrence frequencies from annotated corpora, conceptual relationships and conceptual features for a machine-readable dictionary, and syntactic clues from our linguistic observations. We use dictionary definitions and human knowledge to overcome the sparse-data problem. An experiment shows an accuracy rate of 87.7% of our method over 3043 sentences in real English text that contain ambiguous PPs. This result is better than those of any existing methods.