Skip to content
Change the repository type filter

All

    Repositories list

    • Metadata sources for all service providers in the CLARIN Service Provider Federation
      Shell
      63000Updated Dec 11, 2025Dec 11, 2025
    • CLARIN-PL digital library based on DSpace
      Java
      1.4k001Updated Oct 31, 2025Oct 31, 2025
    • Inforex

      Public
      Inforex is a web system for text corpora construction.
      JavaScript
      91213Updated Jun 26, 2025Jun 26, 2025
    • standards

      Public
      CLARIN-PL work space for the Standards and Interoperability Committee
      XQuery
      26000Updated Jun 11, 2025Jun 11, 2025
    • 0000Updated Apr 18, 2025Apr 18, 2025
    • Java
      1007Updated Apr 10, 2025Apr 10, 2025
    • Annotation Pro plugin utilizing ClarinPL speech tools and models
      C++
      1300Updated Nov 27, 2024Nov 27, 2024
    • PUGG

      Public
      Python
      0000Updated Aug 12, 2024Aug 12, 2024
    • RetNet

      Public
      Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.
      Jupyter Notebook
      26000Updated Apr 2, 2024Apr 2, 2024
    • Jupyter Notebook
      4913Updated Mar 28, 2024Mar 28, 2024
    • An advanced, extensible web front-end for the Manatee-open corpus search engine
      TypeScript
      24000Updated Dec 14, 2023Dec 14, 2023
    • Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
      Python
      337435Updated Dec 3, 2023Dec 3, 2023
    • klajster

      Public
      Python
      0001Updated Nov 30, 2023Nov 30, 2023
    • argilla

      Public
      ✨Argilla: the open-source data curation platform for LLMs
      Python
      474000Updated Nov 26, 2023Nov 26, 2023
    • LEPISZCZE

      Public
      This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
      Python
      21411Updated Nov 24, 2023Nov 24, 2023
    • doccano

      Public
      Open source annotation tool for machine learning practitioners.
      Python
      1.8k104Updated Nov 9, 2023Nov 9, 2023
    • 0000Updated Oct 14, 2023Oct 14, 2023
    • Source code for paper "From Big to Small Without Losing It All: Text Augmentation with ChatGPT for Efficient Sentiment Analysis" published at the 13th ICDM Work…
      Jupyter Notebook
      0300Updated Oct 12, 2023Oct 12, 2023
    • Source code for paper "Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems" published at the 13th ICDM Workshop on Sentiment Elicitation…
      Jupyter Notebook
      0000Updated Oct 3, 2023Oct 3, 2023
    • Source code for paper "Capturing Human Perspectives in NLP: Questionnaires, Annotations, and Biases" published at the 2nd Workshop on Perspectivist Approaches t…
      Jupyter Notebook
      1000Updated Sep 17, 2023Sep 17, 2023
    • 0000Updated Jul 31, 2023Jul 31, 2023
    • Liner2

      Public
      Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions and events.
      Java
      61350Updated Jun 5, 2023Jun 5, 2023
    • A simple client for doccano API.
      Python
      68000Updated Apr 13, 2023Apr 13, 2023
    • Temporal storage for LEPISZCZE datasets descriptions
      0000Updated Mar 29, 2023Mar 29, 2023
    • A tool for recognition of spatial expressions containing trajector, spatial indicator and landmark.
      Python
      0001Updated Mar 24, 2023Mar 24, 2023
    • Tool for named entity recognition for Polish based on deep learning.
      Python
      63111Updated Mar 24, 2023Mar 24, 2023
    • Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"
      Jupyter Notebook
      42900Updated Mar 7, 2023Mar 7, 2023
    • Single page application, angular (http://polonjid-dictionary.clarin-pl.eu)
      TypeScript
      0007Updated Feb 4, 2023Feb 4, 2023
    • Polem

      Public
      Tool for lemmatization of multi-word phrases and named entities for Polish.
      HTML
      4910Updated Dec 6, 2022Dec 6, 2022
    • Wordnet Visual Editor
      Java
      2104Updated Nov 24, 2022Nov 24, 2022