Real World Auto-Tagging of Scientific Text


Authors:

Abstract:

The relevant error rate for practical part-of-speech tagging systems is lower than the figures cited in the literature and yet remains a major factor in information extraction. We identify a source of non-trivial error and present a tool which can improve tagging accuracy. We also propose a characterization of serious and non-serious part-of-speech tagging errors.

Keywords:

Part of speech tagging, neural networks, defining serious error