Skip to content

Inclusion of tldr-pages corpus from OPUS in data-index.json #33

@SethFalco

Description

@SethFalco

Over at tldr-pages, we developed a project to adapt the contents of tldr into a corpus for OPUS.

Would it be feasible to include it in argos-train/data-index.json?
I hoped that it'd be good for including more context around technical/command-line terminology in the dataset.

OPUS corpus: https://opus.nlpl.eu/tldr-pages.php

Related

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions