figshare
Browse
.RAR
VTKEL_dataset_of_30K documents.rar (42.69 MB)
TEXT
VTKEL_dataset_31K_documents_revised.ttl (722.87 MB)
1/0
2 files

VTKEL: 30K documents dataset for Visual-Textual-Knowledge Entity Linking

Version 6 2020-09-09, 14:15
Version 5 2019-08-28, 10:12
Version 4 2019-04-10, 11:06
Version 3 2019-04-01, 14:23
Version 2 2019-04-01, 14:18
Version 1 2019-04-01, 13:36
dataset
posted on 2020-09-09, 14:15 authored by Shahi DostShahi Dost, Luciano Serafini, Marco Rospocher, Lamberto Ballan, Alessandro Sperduti

(Updated version, after fixed some bugs)

VTKL dataset, contains documents composed of pictures with five corresponding textual captions for each image. The VTKL dataset is obtained by extending the Flikr30k dataset, designed for visual-textual mention alignment, with links to YAGO ontolgy, one of the largest web knowledge base. These links are obtained automatically by processing each image caption with PIKES, an NLP tool for entity recognition and linking.

Funding

Fondazione Bruno Kessler, Italy

History