figshare
Browse
RASH_evaluation.zip (198.54 kB)

Outcomes of SAVE-SD 2015 and 2016 questionnaires on RASH and analysis of RDF annotations in the RASH papers.

Download (224.6 kB)
Version 5 2017-07-06, 15:30
Version 4 2016-10-04, 15:34
Version 3 2016-10-04, 15:33
Version 2 2016-10-04, 14:27
Version 1 2016-10-04, 14:20
dataset
posted on 2017-07-06, 15:30 authored by Francesco OsborneFrancesco Osborne, Silvio PeroniSilvio Peroni
This dataset contains all the source materials and the data collected for the evaluation of RASH (Research Articles in Simplified HTML), which is presented in the paper:

Peroni, S., Osborne, F., Di Iorio, A., Nuzzolese, A., Poggi, F., Vitali, F., Motta, E. (2017). Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles. https://w3id.org/people/essepuntato/papers/rash-peerj2016.html

submitted to the PeerJ Computer Science.


In particular this archive contains seven items:
- the file "README.txt" (this file);
- the directory "save-sd2015", containing the source of six RASH articles submitted to SAVE-SD 2015 and their related RDF statements extracted from them and stored as Turtle files;
- the directory "save-sd2016", containing the source of five RASH articles submitted to SAVE-SD 2015 and their related RDF statements extracted from them and stored as Turtle files;
- the directory "stats", containing CSVf files with data about the RDF statements extracted from the RASH papers presented in the two edition of SAVE-SD.
- the directory "script" containing the Python scripts creating the CSV files;
- the file "script.sh" that runs the computation for creating all the statistics stored in the directory "stats";
- the directory "questionnaires", containing four CSV files reporting the questionnaires filled in by authors and reviewers of RASH papers published in the SAVE-SD 2015 and SAVE-SD 2016 workshops.

In particular, the "stats" directory contains two directories describing the data about the statements of the RASH papers of SAVE-SD 2015 ("2015") and of SAVE-SD 2016 ("2016"). A summary of all these data is provided in the directory "tot".

Each of these directories contains four distinct CSV files:
- "stats_short.csv" contains all the numeric data related to the vocabularies used in the statements and the way they have been involved in the RASH papers;
- "stats.csv" extends the previous CSV file by adding also all the information related to each of the entities involved per vocabulary;
- "stats_perc.csv" contains the percentages of the vocabularies used in the statements and the way they have been involved in the RASH papers;
- "stats_short_perc.csv" extends the previous CSV file by adding also all the information related to each of the entities involved per vocabulary.

In the CSV tables, the last columns are dedicated to some metrics calculated starting from the values specified for each vocabulary/entity involved in the papers. In particular:
- "TOTAL" is the sum of all the statements indicated in a row;
- "mean" is the arithmetic mean of all the statements indicated in a row;
- "std" is the standard deviation of related to the arithmetic mean;
- "sqrt" is the sum of all the square root values of all the statements indicated in a row;
- "log" is the sum of all the natural logarithm values of all the statements indicated in a row.


For any question about the data please contact francesco.osborne@open.ac.uk or silvio.peroni@unibo.it

History