Skip to content
Change the repository type filter

All

    Repositories list

    • g2p

      Public
      Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
      Python
      34000Updated Nov 6, 2025Nov 6, 2025
    • 🤗 AutoTrain Advanced for integration with LiFE App Training Module
      Python
      633000Updated Jul 9, 2025Jul 9, 2025
    • stopwords

      Public
      This repository contains the stopword list in various Indian languages. This list is expected to be useful for researchers working in different fields including AI/NLP, Linguistics, Digital Humanities, etc
      0000Updated Feb 15, 2025Feb 15, 2025
    • life

      Public
      Linguistic Field Data Management and Analysis System [LiFE]
      Python
      1700Updated Oct 27, 2024Oct 27, 2024
    • Repository of data and scripts of UGC-UKIERI Project on "Automatic Detection of Verbal Threat in HIndi and English Aggressive Speech"
      Praat
      1000Updated Jun 9, 2024Jun 9, 2024
    • harmpot

      Public
      This repository contains the dataset, models and other details about the HarmPot (Measuring Harm Potential of Social Media Content in India) Project.
      0000Updated May 22, 2024May 22, 2024
    • A repository of the social media dataset in Hindi, annotated with politeness levels
      1000Updated Jan 10, 2024Jan 10, 2024
    • Repository of the data and tools for propaganda identification in HIndi
      Jupyter Notebook
      1000Updated Jan 10, 2024Jan 10, 2024
    • ComMA

      Public
      Dataset of 20,000 datapoints in Meitei, Bangla and Hindi, richly annotated with different levels of aggression and bias for the ComMA Project.
      1000Updated Jan 10, 2024Jan 10, 2024
    • SpeeD-IA

      Public
      Repository for different Speech Datasets and Models for Indo-Aryan languages.
      1000Updated Nov 27, 2023Nov 27, 2023
    • SpeeD-IL

      Public
      Central Repository for the Speech Datasets and Models in Indian Languages (SpeeD-IL) project. Each language family has a separate, dedicated repository linked to this central repository.
      0100Updated Aug 17, 2023Aug 17, 2023
    • crawlers

      Public
      Crawlers for automatically collecting data from different sources
      Python
      4000Updated Apr 4, 2023Apr 4, 2023
    • Punctuation Models for 12 Indian Languages
      Python
      25000Updated Dec 15, 2022Dec 15, 2022
    • Read, write, and manipulate Praat TextGrid files with Python
      Python
      30000Updated Nov 4, 2022Nov 4, 2022
    • mscrabble

      Public
      Repository for Multilingual Scrabble Generator and Games - especially aimed towards endangered languages
      JavaScript
      3100Updated Dec 16, 2021Dec 16, 2021