Datasets in Spanish for short/long slots and short/long sentences
datasetposted on 15.03.2022, 14:19 authored by Surya RocaSurya Roca, Sophie Rosset, José García, Álvaro Alesanco
Datasets generated in Spanish for medication management scenarios.
It consists of four datasets:
- Short slots and short sentences
- Short slots and long sentences
- Long slots and short sentences
- Long slots and long sentences
All the datasets include the train and the development data.
The files are the followings:
- data: The sentence, the slot tags, and the intent (separated with tabulations).
- label: The sentences' intents.
- seq.in: The sentences.
- seq.out: The slot tags (following the IOB format).