LISC 2013 - Results: Discussion Groups on Semantic Web and Reproducibility
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
Results of discussion groups at the Linked Science Workshop 2013 held at the International Semantic Web Conference. (http://linkedscience.org/events/lisc2013/)
Participants were asked to develop a matrices about how semantic web/linked data solutions can help address reproducbility/re* problems. The results are documented in the spreadsheets and videos above.
The participants also developed a set of challenges for the Linked Science and broader semantic web community to help address these re* problems. See below or (lisc2013-challenges.txt)
Linked Science Community Challenges
The Linked Science 2013 workshop discussion participants identified several challenges to the Linked Data/Semantic Web community in order to help reproducibility (and other re* problems i.e. repurposing, reuse, etc) in science.
1) Promote the basics of linked data for reproducibility
Many basic linked data technologies (e.g. content negotiation or the use of dereferenceable URLs) could be usable for scientific reproducibility and reproducibility. The goal here would be to develop a set of how-to documents that guide e-scientists on how to use these technologies to support scientific re* problems. An important point would be to tie these solutions directly to domain scientist problems.
2) Integrate Semantic Web technologies and the publishing process.
Publishing is central to the scientific process and the issues of reusing scientific work. Semantic Web technologies should be integrated into the publishing process to enable reuse.
3) Make it easier to publish data and then work with it than work directly on your own data.
Publishing data should enable a scientist to do more. Can we make it so that publishing data is so useful to the scientist themselves that it would be their first option?
4) Provide an integrated view of the how, what, when, where, and why of the scientific process.
Linked data technologies are designed for integration and aggregation. Can we use these technologies to provide an integrated view over all the questions one might have with respect to a scientific experiment?
5) Provide a mechanisms for dealing with copyright on data both from a technical and social perspective.
Dealing with copyright is not always straightforward. Can we eliminate the barriers to reuse through helping scientists with these copyright issues in an automatic fashion.
6) Get an altmetric based award into one of our own venues.
Part of supporting re* problems is promoting sharing. We should "eat are own dogfood" by promoting and rewarding sharing in the major semantic web venues. We suggest an award based on some sort of altmetric.
7) Make sure the EBI RDF platform does not get shut down in two years.
The European Bioinformatics Institute has released RDF versions with SPARQL endpoints for many of their core data sets. They are making it available for two years and checking on whether it is used to determine if it continues in the long term. This is a key data resource for using Linked Data for reproducibility - let's make sure it keeps going.