This FREEBASE FB15k DATA consists of a collection of triplets (synset, relation_type, triplet) extracted from Freebase (http://www.freebase.com). This data set can be seen as a 3-mode tensor depicting ternary relationships between synsets.
All freebase_mtr100_mte100-*.txt files contain one triplet per line, with 2 mids (unique Freebase entity identifier) and relation type identifier in a tab separated format. The first element is the mid of the left hand side (of head) of the relation triple, the third one is the mid of the right hand side (or tail) and the second element is the name of the relationship between them.
There are 14,951 mids and 1,345 relation types among them. The training set contains 483,142 triplets, the validation set 50,000 and the test set 59,071.
All triplets are unique and we made sure that all synsets appearing in the validation or test sets were occurring in the training set.
The FB15k.zip file contains the other files, with more compression than the default "download all".