A corpus of 85 books five for each of different
17 European languages, some of them were taken from the page www.gutenberg.org. For each book
we also include the linguistic family. The texts were chosen by no other reason
than to avoid, as much as possible, the repetitive texts like poetry.