classla-training

This repository contains the training scripts for the CLASSLA pipeline and the evaluation results for the new models.

The latest training process for standard Slovenian was carried out in January 2023 on the Slovenian SUK training corpus. The following train-dev-test splits for the SUK training corpus were used.

A detailed description of the training and evaluation processes can be found in sl/standard/README.detailed.md, along with a table of evaluation results. The conllu/ directory contains the gold data for SUK, eval_scores/ contains detailed evaluation results, and out/ contains all of the predictions.

Latest training of Slovenian non-standard models was performed in March 2023 on the Janes-Tag 3.0 corpus. The training and evaluation processes for sl non-standard models is detailed in sl/non-standard/README.detailed.nonst.md.

The first training of spoken models for Slovenian and a new model for Slovenian UD dependency parsing (trained on SUK v1.1) were performed in January 2025. The training processes are detailed in sl/spoken/ and sl/standard_SUK_1.1/.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
bg/standard		bg/standard
hr		hr
mk/standard		mk/standard
sl		sl
sr		sr
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

classla-training

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

clarinsi/classla-training

Folders and files

Latest commit

History

Repository files navigation

classla-training

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages