TEXT
1/1
VTKEL: 30K documents dataset for Visual-Textual-Knowledge Entity Linking
dataset
posted on 2020-09-09, 14:15 authored by Shahi DostShahi Dost, Luciano Serafini, Marco Rospocher, Lamberto Ballan, Alessandro Sperduti(Updated version, after fixed some bugs)
VTKL dataset, contains documents composed of pictures with five corresponding textual captions for each image. The VTKL dataset is obtained by extending the Flikr30k dataset, designed for visual-textual mention alignment, with links to YAGO ontolgy, one of the largest web knowledge base. These links are obtained automatically by processing each image caption with PIKES, an NLP tool for entity recognition and linking.
Funding
Fondazione Bruno Kessler, Italy
History
Usage metrics
Categories
Keywords
Knowledge Representation,Computer vision; Image processing; Manufacturing systems; Defect detection; Hot rolling; Rail; ProfileNatural Language Processing, Word EmbeddingsEntity Recognitionentity linkingMultimedia ProgrammingNatural Language ProcessingArtificial Intelligence and Image ProcessingComputer Vision