Browse (2.67 MB)

Datasets in Spanish for short/long slots and short/long sentences

Download (2.67 MB)
posted on 2022-03-15, 14:19 authored by Surya RocaSurya Roca, Sophie Rosset, José García, Álvaro Alesanco
Datasets generated in Spanish for medication management scenarios.
It consists of four datasets:
- Short slots and short sentences
- Short slots and long sentences
- Long slots and short sentences
- Long slots and long sentences

All the datasets include the train and the development data.

The files are the followings:
- data: The sentence, the slot tags, and the intent (separated with tabulations).
- label: The sentences' intents.
- The sentences.
- seq.out: The slot tags (following the IOB format).


Ministerio de Economía, Industria y Competitividad of the Gobierno de España and the European Regional Development Fund (TIN2016-76770-R and BES-2017-082017)

The Gobierno de Aragón (Reference Group T31_20R)

FEDER 2014-2020 "Construyendo Europa desde Aragón"