Email Contact: bioie@ldc.upenn.edu
This material is based upon work supported by the National Science Foundation under Grant No.: EIA-0205448
Home
      -Overview
      -About Us/Credits
Resources
      -Data
      -Publications
      -Works of Interest
Documentation
      -User Guide
      -Guided Tour
      -Description of Data
      -Software/Tools
      -Docs for Annotators
Software/Tools
      -WordFreak
      -LAW Workflow System
      -Annotation Database
      -Auto Text Processing
      -TreeEditor
      -Taggers
Archive Releases
 
Printer Friendly Version
 
Mining the Bibliome
Description of Data
 
This page provides a navigation and description through the "Data" section, the current release to this project.

Contents

  1. Data
  2. Documentation



Data

The main page contains the link to the data section:

Once the term "Data" is clicked, the data section contains links to the CYP450, Oncology, and Tools sections of each project:

CYP450

Once the term "CYP450" is clicked, the CYP450 section contains links to the Entity and Treebank sections belonging to the CYP450 domain:

Entity

Once the term "Entity" is clicked, the Entity section contains links to the HTML View, WordFreak Files, and Extracted ML Data sections belonging to the CYP450 domain:

HTML View

Once the term "HTML View" is clicked, the HTML View section contains a list of CYP450 files that have been annotated:

This CYP450 HTML View link describes how to view these annotated files through its legends, display controls, and a direct link to where the article came from the PubMed site.

WordFreak Files

Under the WordFreak Files section, the term "[get the archive]" may be clicked to download raw and annotated CYP450 files. These files may be seen using WordFreak.

Extracted ML Data

Under the Extracted ML Data section, the term "[statistics]" may be clicked to see the total amount of CYP450 files, along with the number of annotations, relations, and chains that may be contained in these files. Those with permissible server rights may click on the terms "Extracted ML Data" to get extracted CYP450 anntation information in XML format.

Treebank

Under the Treebank section, the term "[get the archive]" may be clicked to download Treebank annotation information in a special ".mrg" format for files in the CYP450 domain.

Oncology

Once the term "Oncology" is clicked, the Oncology section contains links to the Entity and Treebank sections belonging to the Oncology domain:

Entity

Once the term "Entity" is clicked, the Entity section contains links to the HTML View, WordFreak Files, and Extracted ML Data sections belonging to the Oncology domain:

HTML View

Once the term "HTML View" is clicked, the HTML View section contains a list of Oncology files that have been annotated:

This Oncology HTML View link describes how to view these annotated files through its legends, display controls, and a direct link to where the article came from the PubMed site.

WordFreak Files

Under the WordFreak Files section, the term "[get the archive]" may be clicked to download raw and annotated Oncology files. These files may be seen using WordFreak.

Extracted ML Data

Under the Extracted ML Data section, the term "[statistics]" may be clicked to see the total amount of Oncology files, along with the number of annotations, relations, and chains that may be contained in these files. Those with permissible server rights may click on the terms "Extracted ML Data" to get extracted Oncology anntation information in XML format.

Treebank

Under the Treebank section, the term "[get the archive]" may be clicked to download Treebank annotation information in a special ".mrg" format for files in the Oncology domain.

Tools

Once the term "Tools" is clicked, the Software/Tools page of the BioIE website is brought up.

Documentation

The main page contains the link to the documentation section:

Once the term "Documentation" is clicked, it opens up a documentation page which briefly describes the "Mining the Bibliome" project.