@benjelloun cc. @wumpus
Sharing a draft zip file as followup to #961
CCF_crawl_croissants_and_provenance_mockup.zip
Zip file includes:
- 117 croissant drafts, one for each of our crawls.
- 1 mockup example for provenance citation to our crawls
- This kind of hierarchy doesnt exist in our crawls, so we wont actually have this file in CCF, but a mockup for datasets referring to CCF.
We would like feedback especially on:
Please let us know if anything looks awry!
Changes since #961:
- New FileObject added:
{crawl_id}.domains-top-1000 (crawls > 2012)
- Switched to using MAJOR.MINOR.PATCH also for build version:
1.0.0+1.0.0
@benjelloun cc. @wumpus
Sharing a draft zip file as followup to #961
CCF_crawl_croissants_and_provenance_mockup.zip
Zip file includes:
We would like feedback especially on:
How we are using provenance
in how we use
"distribution"with FileObjects and FileSetswarc.paths.gzFileObject pointing toPlease let us know if anything looks awry!
Changes since #961:
{crawl_id}.domains-top-1000(crawls > 2012)1.0.0+1.0.0