Skip to content

Commit 1a1bd13

Browse files
authored
Fix the license term for the dataset (#25)
* Update README.md * Add README to bench/data * fix * fix
1 parent 051933c commit 1a1bd13

File tree

2 files changed

+10
-0
lines changed

2 files changed

+10
-0
lines changed

README.md

+2
Original file line numberDiff line numberDiff line change
@@ -208,6 +208,8 @@ Licensed under either of
208208

209209
at your option.
210210

211+
For softwares under `bench/data`, follow the license terms of each software.
212+
211213
## Contribution
212214

213215
Unless you explicitly state otherwise, any contribution intentionally submitted

bench/data/README.md

+8
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
# Datasets for benchmarking
2+
3+
These datasets are copied from third party repositories.
4+
5+
* `unidic`: [National Institute for Japanese Language and Linguistics](https://ccd.ninjal.ac.jp/unidic/)
6+
* `sherlock.txt`: [Project Gutenberg](https://www.gutenberg.org/ebooks/1661)
7+
* `wagahaiwa_nekodearu.txt`: [Aozora Bunko](https://www.aozora.gr.jp/cards/000148/card789.html)
8+
* `words_100000`: [fst crate](https://github.com/BurntSushi/fst/blob/master/data/words-100000)

0 commit comments

Comments
 (0)