figshare
Browse
protein_BioASQ_average_dedup_vector.txt (3.54 GB)

BioASQvec Plus.txt

Download (3.54 GB) This item is shared privately
dataset
modified on 2019-04-12, 02:13
BioASQvec Plus is an extended version of BioASQvec(http://bioasq.org/news/bioasq-releases-continuous-space-word-vectors-obtained-applying-word2vec-pubmed-abstracts
) taking the advantage of protein alias corpus retrieved from biological databases and biomedical publications. Not only does it contains a bigger corpus of bio-entity names, but also can assign an equal representation to different names that correspond to the same entity. BioASQvec Plus is a generic word embeddings which could be applied to different biomedical text mining models for improving word representations.

Funding

This work has been supported by The National Key Research and Development Program of China (No. 2018YFC0910404); National Natural Science Foundation of China (Grant NO: 61772409).