figshare
Browse
matscholar_entities.json (1.43 GB)

Entities database

Download (1.43 GB)
dataset
posted on 2019-06-04, 17:47 authored by Leigh WestonLeigh Weston
This json file contains the named entities extracted from 3.27 million materials science abstracts.

Each document is indexed by it's digital object identifier (DOI) which may be used to find the original article. Each document contains the following entity types: material (MAT), sample descriptor (DSC), symmetry/phase label (SPL), property (PRO), application (APL), synthesis method (SMT), and characterization method (CMT).

The named entity recognition was optimized for inorganic materials science.

Funding

This work was supported by Toyota Research Institute through the Accelerated Materials Design and Discovery program.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC