figshare
Browse
OpenRefineInstitutionMatching.pptx (100.96 MB)

Institutions in OpenRefine: Cleaning up messes

Download (100.96 MB)
Version 2 2018-01-24, 14:10
Version 1 2018-01-23, 17:53
presentation
posted on 2018-01-24, 14:10 authored by Arthur SmithArthur Smith

OpenRefine (formerly Google Refine) is a tool for cleaning up messy data, but it is also well-suited to cross-linking different identifiers through their metadata entries. I will share some experiences in using OpenRefine to interlink GRID, ISNI, Wikidata, and our own internal institution identifiers. This matching process also highlights metadata errors (including duplicates and incorrect merges) in the different sources, and can help improve data quality on all sides. (Updated to full version with videos)

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC