Skip to content

Record logic for some processors may be wrong #68

Open
@hunterhector

Description

@hunterhector

Describe the bug
For example, in the Spacy record function:
https://github.com/asyml/forte-wrappers/blob/main/src/spacy/forte/spacy/spacy_processors.py#L280

The record_meta for the Sentence type is first set to an empty set. However, this may not be true if there are existing processors that already add Sentences.

The problem is that what should we do when two processors both add the Sentence type? We probably should maintain separate records for each processor, something like:

{
    "spacy": "ft.onto.base_ontology.Token": {"ner", "pos"}
    "stanza": "ft.onto.base_ontology.Token": {"lemma", "pos"}
}
``

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions