-
Notifications
You must be signed in to change notification settings - Fork 2
Outputs of CellRangerCount seem to contain duplicated files in an archive #11
Copy link
Copy link
Open
Labels
bugSomething isn't workingSomething isn't working
Description
CellRangerCount seems to create a folder with a bunch of files.
Inside that folder there is a .tgz archive that contain a duplication of the files already in that folder. That TAR file contains most, but not all of the files in the output folder.
This should be cleaned up. Either:
- the tar file is completely redundant and should be removed before saving the output folder or
- the tar file is supposed to be the final product (with a few less files) and we shouldn't even attempt to save the output folder.
How to check as a CBRAIN developer:
a) cd to the root of a finished CellRangerCount task
b) create the list of files in the output dir:
find cellranger_count_res -type f | sort >/tmp/listfiles # adjust name of folderc) create the list of files in the .tgz archive
tar -tzf cellranger_count_res/cellranger_count_res.mri.tgz | sort >/tmp/archfilesd) Compare them with diff, or csdiff:
diff /tmp/listfiles /tmp/archfilesReactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working