figshare
Browse
numdb_0105.zip (51.96 MB)

NumDB-dataset

Download (51.96 MB)
Version 4 2018-05-01, 18:03
Version 3 2018-05-01, 18:03
Version 2 2018-05-01, 18:02
Version 1 2018-05-01, 17:21
dataset
posted on 2018-05-01, 18:03 authored by Alessandro PiscopoAlessandro Piscopo, Emilia KacprzakEmilia Kacprzak
NumDB benchmark: set of tables originally extracted from DBpedia, from which different value samples have been selected and various degrees of errors have been added in order to simulate actual tables on the Web.
The dataset has been created for
Kacprzak, E., Giménez-García, J. M., Piscopo, A., Koesten, L., Ibáñez, L. D., Tennison, J., & Simperl, E. (2018, November). Making Sense of Numerical Data-Semantic Labelling of Web Tables. In European Knowledge Acquisition Workshop (pp. 163-178). Springer, Cham.
A description of the data generation process is in the paper.

Funding

Marie Skłodowska-Curie grant agreement No. 642795 (WDAqua ITN)

History