Skip to content

Commit a853c2a

Browse files
committed
docs: Added link to Common Crawl's terms of use
1 parent 05ce7fd commit a853c2a

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

docs/versions/mOSCAR.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -27,8 +27,8 @@ Paper link: [https://arxiv.org/abs/2406.08707](https://arxiv.org/abs/2406.08707)
2727

2828
## Language table
2929

30-
| Lang. name | Code | Family | Script | #documents | #images | # tokens |
31-
| ---------------------- | -------- | ------------- | ---------- | ----------- | ----------- | -------------- |
30+
| Lang. name | Code | Family | Script | #documents | #images | # tokens |
31+
| ---------------------- | -------- | -------------- | ---------- | ---------- | ----------- | -------------- |
3232
| Acehnese | ace_Latn | Austronesian | Latin | 7,803 | 32,461 | 2,889,134 |
3333
| Mesopotamian Arabic | acm_Arab | Afro-Asiatic | Arabic | 2,274 | 10,620 | 1,047,748 |
3434
| Tunisian Arabic | aeb_Arab | Afro-Asiatic | Arabic | 7,640 | 41,570 | 2,715,187 |
@@ -202,6 +202,8 @@ These data are released under this licensing scheme:
202202
- To the extent possible under law, Inria has waived all copyright and related or neighboring rights to OSCAR.
203203
- This work is published from: France.
204204

205+
Please also refer to Common Crawl's [Terms of Use](https://commoncrawl.org/terms-of-use)
206+
205207
## Citation
206208
```
207209
@article{futeral2024moscar,

0 commit comments

Comments
 (0)