Skip to content
Change the repository type filter

All

    Repositories list

    • Jupyter Notebook
      0100Updated Dec 23, 2025Dec 23, 2025
    • The CLASSLA-web corpora website.
      HTML
      0000Updated Dec 22, 2025Dec 22, 2025
    • parlacap

      Public
      HTML
      0000Updated Dec 19, 2025Dec 19, 2025
    • Conversion to and from the CLARIN.SI TEI format
      XSLT
      0000Updated Dec 17, 2025Dec 17, 2025
    • Corplus

      Public
      Corplus: A concordancer for corpora with language corrections
      TypeScript
      0000Updated Dec 15, 2025Dec 15, 2025
    • Automatic treebank comparison tool supporting various levels of linguistic analysis
      Python
      0100Updated Dec 12, 2025Dec 12, 2025
    • Python
      0000Updated Dec 10, 2025Dec 10, 2025
    • Repository for SloBench evaluation docker images
      Perl
      5100Updated Oct 29, 2025Oct 29, 2025
    • Code for bootstrapping ASR datasets from parliamentary recordings and transcripts
      1810Updated Oct 9, 2025Oct 9, 2025
    • An ever-expanding overview of the knowledge on large language models (LLMs), speech technologies, and other NLP technologies for Slovenian language.
      1800Updated Sep 24, 2025Sep 24, 2025
    • STARK

      Public
      Python
      4500Updated Aug 24, 2025Aug 24, 2025
    • LINDAT/CLARIN digital repository based on DSpace
      Java
      1.4k0120Updated Jun 11, 2025Jun 11, 2025
    • Python
      0000Updated Jun 4, 2025Jun 4, 2025
    • drevesnik

      Public
      Web portal for searching and displaying syntacically annotated corpora
      JavaScript
      0100Updated May 22, 2025May 22, 2025
    • 0000Updated May 15, 2025May 15, 2025
    • 0000Updated May 13, 2025May 13, 2025
    • classla

      Public
      CLASSLA Fork of the Official Stanford NLP Python Library for Many Human Languages
      Python
      9324611Updated May 6, 2025May 6, 2025
    • siius

      Public
      Digital library and corpus of older Slovenian legal texts SI-IUS
      XSLT
      0000Updated Apr 25, 2025Apr 25, 2025
    • benchich

      Public
      BENCHić - the benchmark for Bosnian, Croatian, Montenegrin, Serbian (and friends)
      Python
      0210Updated Apr 22, 2025Apr 22, 2025
    • Code for ParlaSent research note
      Jupyter Notebook
      1000Updated Apr 22, 2025Apr 22, 2025
    • Recommended TEI schema for CLARIN.SI resources, cf. also https://clarinsi.github.io/TEI-schema/
      XSLT
      0200Updated Mar 16, 2025Mar 16, 2025
    • Training scripts for the CLASSLA pipeline
      Python
      0000Updated Feb 19, 2025Feb 19, 2025
    • 0000Updated Feb 13, 2025Feb 13, 2025
    • ROG

      Public
      Elixir
      0000Updated Dec 20, 2024Dec 20, 2024
    • mte-msd

      Public
      MULTEXT-East morphosyntactic specifications
      HTML
      11000Updated Nov 24, 2024Nov 24, 2024
    • Python
      1000Updated Nov 18, 2024Nov 18, 2024
    • Repo for tracking resources for the Mezzanine project
      0000Updated Nov 12, 2024Nov 12, 2024
    • Editor for normalising learner texts (error annotation and tagging.)
      TypeScript
      4000Updated Sep 4, 2024Sep 4, 2024
    • Tool for extracting linguistic features with highest (known) variation among the HBS standards
      Python
      0000Updated Jul 17, 2024Jul 17, 2024
    • A two-mode (standard, nonstandard) tokeniser for South Slavic languages
      Python
      7521Updated Jul 9, 2024Jul 9, 2024