figshare
Browse
Data enhancing the Royal Society of Chemistry publication archive.ppt (2.07 MB)

Data enhancing the Royal Society of Chemistry publication archive

Download (0 kB)
presentation
posted on 2014-03-22, 03:41 authored by Antony WilliamsAntony Williams, Colin Batchelor, Peter Corbett, Ken Karapetyan, Valery TkachenkoValery Tkachenko

The Royal Society of Chemistry has an archive of hundreds of thousands of published articles containing various types of chemistry related data – compounds, reactions, property data, spectral data etc. RSC has a vision of extracting as much of these data as possible and providing access via ChemSpider and its related projects. To this end we have applied a combination of text-mining extraction, image conversion and chemical validation and standardization approaches. The outcome of this project will result in new chemistry related data being added to our chemical and reaction databases and in the ability to more tightly couple web-based versions of the articles with these extracted data. The ability to search across the archive will be enhanced as a result. This presentation will report on our progress in this data extraction project and discuss how we will ultimately use similar approaches in our publishing pipeline to enhance article markup for new publications.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC