figshare
Browse

Language models for the article "Slovak morphological tokenizer using the Byte-Pair Encoding algorithm"

Version 2 2024-08-15, 18:11
Version 1 2024-06-30, 15:59
software
posted on 2024-08-15, 18:11 authored by Dávid DržíkDávid Držík

Two small language models trained on pureBPE and SKMT tokenizers on 10 epochs.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC