neVANiLLa dataset

Gashkov, Alexander

neVANiLLa dataset

Posted on 2021-01-09 - 16:20 authored by Alexander Gashkov

An extended version of the VANiLLa dataset. It includes negative pairs, e.g. a question and a wrong answer. The Dataset₁ is created by mixing questions and answers from records with different question relation/entity label combination. The Dataset₂ is created from records with same question relation and different entity label, so the Dataset₂ is "harder" to predict. Both datasets are splitted into train and test parts.

Article: A Both, A Gashkov, M Eltsova -- Similarity Detection of Natural-Language Questions and Answers using the VANiLLa dataset.

CITE THIS COLLECTION

DataCite

3 Biotech

3D Printing in Medicine

3D Research

3D-Printed Materials and Systems

4OR

AAPG Bulletin

AAPS Open

AAPS PharmSciTech

Abhandlungen aus dem Mathematischen Seminar der Universität Hamburg

ABI Technik (German)

Academic Medicine

Academic Pediatrics

Academic Psychiatry

Academic Questions

Academy of Management Discoveries

Academy of Management Journal

Academy of Management Learning and Education

Academy of Management Perspectives

Academy of Management Proceedings

Academy of Management Review

Gashkov, Alexander (2021). neVANiLLa dataset. figshare. Collection. https://doi.org/10.6084/m9.figshare.c.5263142.v1

https://doi.org/10.6084/m9.figshare.c.5263142.v1

or

Select your citation style and then place your mouse over the citation text to select it.

REFERENCES

https://figshare.com/articles/dataset/Vanilla_dataset/12360743

SHARE

email

Usage metrics

AUTHORS (1)

AG

Alexander Gashkov

KEYWORDS

question answering Natural language processing Computational Linguistics Natural Language Processing

Search Collections

need help?

neVANiLLa dataset

CITE THIS COLLECTION

REFERENCES

SHARE

Usage metrics

AUTHORS (1)

CATEGORIES

KEYWORDS