GitHub - forTEXT/GermAnProse: A dataset of annotated German prose

GermAnProse

The JSON files each contain the annotations for a single text. Annotator names were replaced to ensure anonymity is retained.

We plan to align the naming within the JSON structure more closely with the naming in our paper for the final release. Here, we provide a broad description of the structure; additional documentation will be released alongside the ChiA guidelines upon acceptance.

One JSON object per document
ChiA Annotations are:
- mentions (for character mentions)
- participations (for the agency data)
- direct_speech
All ChiA annotations are top-level keys in the JSON and contain an object with annotators and their corresponding annotations.
- Each annotation holds a "spans" object referring to the spans of text that are annotated.
The narrativity annotations are called "events", we do not have multiple annotators here
The verb class annotations are available under "verbclasses", we do not have multiple annotators here
"keyness" is our measure of plot keyness
"tokens" and "sentences" are automatically created data (using spaCy)
"scenes" contains the scene annotations
"speech_info" holds the timing information in audiobooks for each reader

For an example of the data structure, see below:

{
    "title": "Document Name",
    "full_text": full_text,
    "participations": {
        "annotator_a": {
            "mentions": {
                "character_a": [{"kind": "name", "spans": [[12, 15]]}]
            }
        },
        "annotator_b": ...
    },
    ...
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Das Erdbeben in Chili.json		Das Erdbeben in Chili.json
Der blonde Eckbert.json		Der blonde Eckbert.json
Die Verwandlung.json		Die Verwandlung.json
Krambambuli.json		Krambambuli.json
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GermAnProse

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Folders and files

Latest commit

History

Repository files navigation

GermAnProse

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Packages