Skip to content

Adding extended nlp_uk morphological tagging for PressMint-UA #51

@Dandelliony

Description

@Dandelliony

Dear PressMint Team,

We are considering adding morphological annotations generated by the nlp_uk tagger to the annotated files. This tagset is our standard for Ukrainian (https://github.com/brown-uk/dict_uk/blob/master/doc/tags.txt).

If I’m not mistaken, you have a similar option for MULTEXT-East annotation and section 6.1.1 of the documentation (https://clarin-eric.github.io/PressMint/#sec-ana-words) also allows this.

The only issue is that we use colons to separate tags. You can see an example of the annotation here: https://github.com/Dandelliony/PressMint/blob/data/Samples/PressMint-UA/Sources/Rada_1910_245_Aresht_semyi.tagged.xml

Please let us know if there are any other points to consider; we are currently reviewing the documentation on this topic.

Best regards,
Arsenij

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions