Version 6 2025-10-23, 20:15Version 6 2025-10-23, 20:15
Version 5 2025-10-23, 20:14Version 5 2025-10-23, 20:14
Version 4 2025-08-10, 13:53Version 4 2025-08-10, 13:53
Version 3 2025-08-10, 13:45Version 3 2025-08-10, 13:45
Version 2 2025-03-26, 15:03Version 2 2025-03-26, 15:03
Version 1 2025-03-25, 21:32Version 1 2025-03-25, 21:32
software
posted on 2025-10-23, 20:15authored byGiuliana Fiorentino, Vittorio Ganfi, Marco RussodivitoMarco Russodivito, Alessandro Cioffi, Maria Ausilia Simonelli
<p dir="ltr">The simplification of language – particularly with regard to administrative discourse – has long been a central concern within Italian linguistics. Over the past few decades, significant progress has been made, including the development of consolidated and widely accepted lists of linguistic features – both morphosyntactic and lexical – that influence textual simplicity and accessibility (cf. Fiorentino/Ganfi 2024). These advances contributed to the early creation of a readability index, the <i>Gulpease index</i>, in the 1980s (cf. Lucisano/Piemontese 1988). Within this framework, the authors have developed a software for the automatic simplification of administrative texts, supported by a large language model (LLM), entitled <i>SEMPL-IT</i> (cf. Russodivito et al. 2024; Fiorentino/Russodivito 2025; Ganfi/Russodivito 2025; Fiorentino et al. in press; Fiorentino/Russodivito in press). As part of this project, a corpus named <i>ItaIst</i> (Fiorentino et al. 2024b) was compiled and subjected to automatic simplification using the <i>BASIC approach</i>, resulting in a parallel corpus of simplified texts. This simplified corpus was then compared to the source corpus and evaluated in terms of improved readability and <i>Semantic similarity</i> (cf. Chandrasekaran et al. 2021), with the objective of validating the effectiveness of the simplification process. In this contribution, we introduce and validate a new methodology – the <i>CHAIN approach</i> – applied to a different corpus, <i>ItaRegol</i> (Fiorentino et al. 2024a). Although smaller in size than <i>ItaIst</i>, <i>ItaRegol</i> comprises rules and regulations, i.e., legally binding texts that create, modify, or extinguish subjective legal positions. Due to the legal nature of these texts, simplification must be carried out with caution to avoid altering their legal effects. This paper compares the two simplification approaches – <i>BASIC</i> and <i>CHAIN</i> – by evaluating the parameters adopted, assessing the quality of the simplified output, and drawing conclusions regarding the differing impact of these strategies in enhancing the readability of administrative versus regulatory texts.</p>