figshare
Browse
1/1
3 files

articles-big_100p-300d-50m-0.emb

journal contribution
posted on 2022-01-23, 19:05 authored by Martin Canaan MafundaMartin Canaan Mafunda, Maria Schuld, Kevin Durrheim, Sindisiwe Mazibuko
These are word embeddings trained on South African News Articles Data. The Word2Vec Framework for training machine learning models for natural language processing was used to produce the word embeddings.

Funding

This work was funded by the Big Data for Science and Society (BDSS) project, UKZN’s Big Data and Informatics’ Research Flagship, the South Africa’s National Research Foundation (NRF) and the South African Centre for Digital Language Resources (SADiLaR).

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC