Skip to content
Change the repository type filter

All

    Repositories list

    • common-voice

      Public
      Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
      TypeScript
      8753.4k16341Updated Jan 29, 2026Jan 29, 2026
    • Python
      254164Updated Dec 18, 2025Dec 18, 2025
    • Metadata and versioning details for the Common Voice dataset
      JavaScript
      18164161Updated Dec 18, 2025Dec 18, 2025
    • Command line tool to create corpora for Common Voice
      Python
      2978108Updated Dec 4, 2025Dec 4, 2025
    • Common Voice documentation
      2000Updated Nov 1, 2025Nov 1, 2025
    • Jupyter Notebook
      0001Updated Aug 11, 2025Aug 11, 2025
    • Tooling for producing French dataset for Common Voice
      Python
      24101117Updated Jan 20, 2025Jan 20, 2025
    • our-voices-model-competition

      Public archive
      Our Voices Competition
      12607Updated Jun 10, 2024Jun 10, 2024
    • community-playbook

      Public
      Mozilla Voice Community Playbook
      184840Updated May 21, 2024May 21, 2024
    • Scraping Wikipedia for fair use sentences
      Rust
      515449Updated Jan 25, 2024Jan 25, 2024
    • Calculate the individual and total duration of a directory full of .mp3 files
      Rust
      0000Updated Nov 1, 2023Nov 1, 2023
    • sentence-collector

      Public archive
      Tool to collect and review sentences for Common Voice
      JavaScript
      628200Updated May 10, 2023May 10, 2023
    • Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
      TypeScript
      875000Updated May 5, 2023May 5, 2023
    • Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
      TypeScript
      875100Updated Apr 17, 2023Apr 17, 2023
    • Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language
      JavaScript
      71183Updated Apr 13, 2023Apr 13, 2023
    • Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
      TypeScript
      875000Updated Mar 22, 2023Mar 22, 2023
    • This is our new repository to make other open speech datasets from the community easier to find. If you'd like to add yours, please get in touch!
      1100Updated Mar 9, 2023Mar 9, 2023
    • Common Voice Helm Charts
      Mustache
      1201Updated Mar 21, 2022Mar 21, 2022
    • Automation for generating the common voice corpora
      Python
      2120Updated Apr 16, 2020Apr 16, 2020
    • Different analysis and files from wikipedia text analysis
      1300Updated Jul 17, 2019Jul 17, 2019
    • A living document outlining a methodological approach for building read speech sentence corpora.
      1400Updated Mar 20, 2019Mar 20, 2019
    • mandarin

      Public
      All efforts around Mandarin dataset
      0010Updated Feb 21, 2019Feb 21, 2019
    • Voicebot for contributing voice snippets to voice.mozilla.org
      Python
      1540Updated Feb 7, 2019Feb 7, 2019
    • A Redux binding for React Router v4
      JavaScript
      586000Updated Aug 29, 2018Aug 29, 2018
    • Manipulate sentences.
      JavaScript
      3200Updated Aug 9, 2018Aug 9, 2018
    • planning

      Public archive
      This is where we organize the work around Common Voice project
      1100Updated Jul 10, 2018Jul 10, 2018
    • This is a repo that will contain all the reviewed sentences collected by the global sprint.
      JavaScript
      2111Updated Jun 28, 2018Jun 28, 2018