figshare
Browse
1/1
2 files

New WhiteText Corpus

dataset
posted on 2015-05-01, 19:26 authored by Leon FrenchLeon French, Po Liu, Olivia Marais, Tianna Koreman, Lucia Tseng, Artemis Lai, Paul Pavlidis


The WhiteText project aims to extract neuroanatomical information from the biomedical literature. We are specifically focused on the extraction of brain regions and connections between them. The corpus from our most recent article are provided above (descriptions below). 

More information at http://www.chibi.ubc.ca/whitetext

WhiteTextUnseenEval.xml: AirolaXML file for the new evaluated corpus that was extracted from the large classification run on the Journal of Comparative Neurology abstracts.

WhiteTextMerged2013.xml: Merged AirolaXML file that has both the old and new manually annotated corpus of connectivity statements.

History