Email Contact: bioie@ldc.upenn.edu
This material is based upon work supported by the National Science Foundation under Grant No.: EIA-0205448
Home
      -Overview
      -About Us/Credits
Resources
      -Data
      -Publications
      -Works of Interest
Documentation
      -User Guide
      -Guided Tour
      -Description of Data
      -Software/Tools
      -Docs for Annotators
Software/Tools
      -WordFreak
      -LAW Workflow System
      -Annotation Database
      -Auto Text Processing
      -TreeEditor
      -Taggers
Archive Releases
 
Printer Friendly Version
 
Mining the Bibliome
 
Annotation Database
 

12-2004 release
previous release

User's Guide

The fields allow you to restrict the search. Shift-click to select multiple contiguous list entries; control-click to select multiple individual entries or to deselect a selected entry. Each string found satisfying the search requirements is displayed on a separate line.

  • File Set: Select the domain(s) you are interested in. "12-04" = version 0.9.

  • Annotation Type: (The label of an annotation.) The drop-down menu begins with the POS tags; the entity and pretagging tags are at the bottom by ASCII sort order.

  • Search Text: (Optional) Return all strings including the specified text. If this field is left blank, the query will return all annotations with the specified label(s) in the specified file set(s). Wildcards are not permitted and the search is not case-sensitive.

  • Results per Page: All, 10, 25, 50, 100, 200, 500, 1000

  • Concordance: If Search Text is specified and Concordance View is checked, the returned strings will be displayed in a monospace font with the Search Text underlined and centered with up to Context width characters of the annotation* on each side.

    For example, searching oncology release 12-04 for Annotation Type malignancy-developmental-state with Search Text "age" produces 14 hits, aligned with enough space to accommodate the longest left and right contexts, which in this case are both less than 35 characters: "infants less than 1 year of age" and "age of six months". Changing Context width to 10 narrows the display, truncating these strings to "1 year of age" and "age of 6 mont".

    The Export to Plain Text option puts tab characters around the Search Text and does not center it properly. In addition, this display preserve the newlines of the source text, which can split a single hit over two or more lines.

    *Note: The context is limited to the annotation with the specified tag, not the surrounding text. If you ask for a concordance view of all instances of the AFX (affix) tag with the text of "pre" in both domains, you will get 42 mentions of "pre" with no context, because there are no AFX mentions which include "pre" as a proper substring. A concordance view of AFX "pr" produces all of those along with 1 "supra" and a lot of "proto"s, all aligned on "pr".