Skip to content

Conversation

@tejasvaidhyadev
Copy link
Member

Adding GMB Dataset.
The dataset an extract from GMB corpus which is tagged, annotated and built specifically to train the classifier to predict named entities such as name, location, etc.

Copy link
Member

@oxinabox oxinabox left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cool, good idea.
A few comments.
Also needs docs and tests

tejasvaidhyadev and others added 3 commits March 11, 2020 20:52
Co-Authored-By: Lyndon White <[email protected]>
Co-Authored-By: Lyndon White <[email protected]>
Co-Authored-By: Lyndon White <[email protected]>
@tejasvaidhyadev
Copy link
Member Author

Thankyou I will implement suggested changes(including Docs and tests ) soon

Co-Authored-By: Lyndon White <[email protected]>
@tejasvaidhyadev
Copy link
Member Author

Hi @oxinabox added some testsets by taking examples from other datasets.I don't know much about tests and i am still learning.
let me know what else tests can be added.

@tejasvaidhyadev
Copy link
Member Author

tejasvaidhyadev commented Mar 14, 2020

Hi @oxinabox
For now I added only POS tagged of GMB
As my project only need POS tags and i will also implement NER tags soon
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants