posted on 2022-03-15, 14:19authored bySurya RocaSurya Roca, Sophie Rosset, José García, Álvaro Alesanco
Datasets generated in Spanish for medication management scenarios.
It consists of four datasets:
- Short slots and short sentences
- Short slots and long sentences
- Long slots and short sentences
- Long slots and long sentences
All the datasets include the train and the development data.
The files are the followings:
- data: The sentence, the slot tags, and the intent (separated with tabulations).
- label: The sentences' intents.
- seq.in: The sentences.
- seq.out: The slot tags (following the IOB format).
Funding
Ministerio de Economía, Industria y Competitividad of the Gobierno de España and the European Regional Development Fund (TIN2016-76770-R and BES-2017-082017)
The Gobierno de Aragón (Reference Group T31_20R)
FEDER 2014-2020 "Construyendo Europa desde Aragón"