figshare
Browse

FB15k

Version 2 2020-02-20, 02:46
Version 1 2020-02-20, 02:39
dataset
posted on 2020-02-20, 02:46 authored by Huon WilsonHuon Wilson
This FREEBASE FB15k DATA consists of a collection of triplets (synset, relation_type, triplet) extracted from Freebase (http://www.freebase.com). This data set can be seen as a 3-mode tensor depicting ternary relationships between synsets.

All freebase_mtr100_mte100-*.txt files contain one triplet per line, with 2 mids (unique Freebase entity identifier) and relation type identifier in a tab separated format. The first element is the mid of the left hand side (of head) of the relation triple, the third one is the mid of the right hand side (or tail) and the second element is the name of the relationship between them.

There are 14,951 mids and 1,345 relation types among them. The training set contains 483,142 triplets, the validation set 50,000 and the test set 59,071.

All triplets are unique and we made sure that all synsets appearing in the validation or test sets were occurring in the training set.

The FB15k.zip file contains the other files, with more compression than the default "download all".

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC