A corpus of about 50000 news articles in Danish, with full metadata, published on TV2 Regionerne between 2010 and 2020. Stored as multiple JSON files, one per article.
Licensed CC-BY. You must reference this Figshare entry if you use this data:
Leon Derczynski and Claus Ladefoged, "TV2 Regionerne News Corpus" (2020). doi:10.6084/m9.figshare.12382610
See articles/README.md, articles/LICENSE.md and articles/DATASTATEMENT.md.