Skip to content

allow languages other than nature languages #3862

@ZhiyuanChen

Description

@ZhiyuanChen

Is your feature request related to a problem? Please describe.

Recently, the huggingface hub enforce a new policy that the language specified in the README must be compatible with one of the ISO 639-1, 639-2 or 639-3 code.

However, several models (like ProGen2) is designed for Protein, and the amino acid code is not in the ISO.

Describe the solution you'd like

add dna, rna and protein as special values like code

Describe alternatives you've considered

use warning instead of raise error for unmatched language

Additional context

There are also models that take chemical molecules as inputs.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions