numdb_0105.zip (51.96 MB)
NumDB-dataset
Version 4 2018-05-01, 18:03
Version 3 2018-05-01, 18:03
Version 2 2018-05-01, 18:02
Version 1 2018-05-01, 17:21
dataset
posted on 2018-05-01, 18:03 authored by Alessandro PiscopoAlessandro Piscopo, Emilia KacprzakEmilia KacprzakNumDB benchmark: set of tables originally extracted from DBpedia, from which different value samples have been selected and various degrees of errors have been added in order to simulate actual tables on the Web.
The dataset has been created for
Kacprzak, E., Giménez-García, J. M.,
Piscopo, A., Koesten, L., Ibáñez, L. D., Tennison, J., & Simperl, E.
(2018, November). Making Sense of Numerical Data-Semantic Labelling of
Web Tables. In European Knowledge Acquisition Workshop (pp. 163-178). Springer, Cham.
A description of the data generation process is in the paper.