Add a url for constructing a hub.txt file for the genome-browser #421
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Based on:
https://genome.ucsc.edu/goldenPath/help/hgTracksHelp.html#UseOneFile
This page essentially defines a hub.txt file for any file in the c2m2
but it will only work for files that:
access_urlpersistent_id with the actual dataWe use the genome: hg19 but this may not be accurate,
it's unclear how we'd determine the correct one at this stage.
A hub can thus be assembled by pointing to the url at:
https://cfde.cloud/data/c2m2/file/{id_namespace}/{local_id}/genome-browser/hub.txt
Some caveats of this, if they weren't already obvious:
access_url
s yet, and the only files which could ever work do not define the right urls in thepersistent_id
field.access_url
, we'll possibly have DRS URIs which it doesn't seem the genome browser will be able to handleBecause of these caveats we need to think more about how we can get the other information to support this use case. We'll need to:
Thoughts: As this is rather specific to these types of files, it would belong in an independent table anyway that simply lists all genome-browser compatible files, it feels to me like it is just a bit out of scope of the current c2m2 and would be more useful as an independent effort to gather "tracks" from DCCs, perhaps as another element of the data matrix.