Datasets.zip (2.67 MB)
Datasets in Spanish for short/long slots and short/long sentences
dataset
posted on 2022-03-15, 14:19 authored by Surya RocaSurya Roca, Sophie Rosset, José García, Álvaro AlesancoDatasets generated in Spanish for medication management scenarios.
It consists of four datasets:
- Short slots and short sentences
- Short slots and long sentences
- Long slots and short sentences
- Long slots and long sentences
All the datasets include the train and the development data.
The files are the followings:
- data: The sentence, the slot tags, and the intent (separated with tabulations).
- label: The sentences' intents.
- seq.in: The sentences.
- seq.out: The slot tags (following the IOB format).