Version 6 2020-09-09, 14:15Version 6 2020-09-09, 14:15
Version 5 2019-08-28, 10:12Version 5 2019-08-28, 10:12
Version 4 2019-04-10, 11:06Version 4 2019-04-10, 11:06
Version 3 2019-04-01, 14:23Version 3 2019-04-01, 14:23
Version 2 2019-04-01, 14:18Version 2 2019-04-01, 14:18
Version 1 2019-04-01, 13:36Version 1 2019-04-01, 13:36
dataset
posted on 2020-09-09, 14:15authored byShahi DostShahi Dost, Luciano Serafini, Marco Rospocher, Lamberto Ballan, Alessandro Sperduti
(Updated version, after fixed some bugs)
VTKL dataset, contains documents composed of pictures with five corresponding textual captions for each image. The VTKL dataset is obtained by extending the Flikr30k dataset, designed for visual-textual mention alignment, with links to YAGO ontolgy, one of the largest web knowledge base. These links are obtained automatically by processing each image caption with PIKES, an NLP tool for entity recognition and linking.