Skip to content

Conversation

@ethanxia4
Copy link
Collaborator

Added wikiconv-corpus folder with updated wikiconv processing code that contains fixes to the original English processing code and flexibility for non English datasets. The code will be updated as the other languages get processed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant