Treebanking

From BioIE Wiki

Jump to: navigation, search

Main Page  : Treebanking


Treebanking is a form of syntactic annotation. PennBioIE treebanking is based on the guidelines of The Penn Treebank Project, with significant adaptations for biomedical vocabulary and entity annotation.

Treebanking Guidelines

The Penn Treebank (English) guidelines:

  • Part-of-Speech Tagging Guidelines for the Penn Treebank Project (3rd Revision, 2nd printing)
  • Bracketing Guidelines for Treebank II Style Penn Treebank Project: A detailed description of the Penn Treebank style, with many examples
    • PDF
    • LaTeX* (FTP directory, with README text file)
  • BioMedical Annotation Addendum to the Penn Treebank II Style Bracketing Guidelines:

* can be viewed with Ghostview

Reference Tools

The Database of Tags does not include treebanking data, but it lets you look up all the annotations in our v0.9 release (12/2004): entity, POS, sentence, section, paragraph, or file.


Main Page  : Treebanking

Personal tools