Metadata harvesting through schema.org

posted on 27.07.2019 by Dave Vieglais, Matthew Jones
Presented for discussion at 2019 ESIP Summer Meeting, 2019-07-17.

Repositories have recognized the benefits of adopting schema.org metadata in their data catalog landing pages to improve discoverability, particularly with the incentive of inclusion in the Google Dataset search. While Google supports broad, general search and discovery, we can also use this mechanism to improve domain-specific aggregated search systems like DataONE. In this working session, we will focus on real world issues of implementing schema.org for repositories, how to link traditional metadata records into dataset landing pages, and how this can result in improved harvesting and representation by science focused aggregators such as DataONE. We will work through recommendations emerging from science-on-schema.org, optimizing JSON-LD to work with major search engines, and options for extending to include more detailed dataset information beyond the typical discovery-level metadata found in most records.


DataONE (Data Observation Network for Earth)

Directorate for Computer & Information Science & Engineering

