-
Notifications
You must be signed in to change notification settings - Fork 20
Add genetic map support for all tools #216
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: dev
Are you sure you want to change the base?
Conversation
atrigila
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since this is a pretty extensive change, I would benefit from reviewing it in incremental stages. I suggest you make PR's to modules. There, you can submit the local subworkflows and add the tests that require using maps. Once we get those approved, it will be easier to then focus on the broader picture here with tests aiming at the whole pipeline tests.
| conda "${moduleDir}/environment.yml" | ||
| container "${ workflow.containerEngine == 'singularity' && !task.ext.singularity_pull_docker_container ? | ||
| 'https://depot.galaxyproject.org/singularity/gawk:5.3.0' : | ||
| 'biocontainers/gawk:5.3.0' }" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we are using GAWK, why don't we use the nf-core module?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The issue to use the nf-core module is that it would result in a more complex looking module (script in the config), splitting output channel based on regex and therefore a more complicated modules to maintain.
Writing it this may does add a new local module, but make it easier to understand how it works.
docs/usage.md
Outdated
| `--map_sep "\t" --map_header true --map_col_names "chr,id,cm,pos"` | ||
|
|
||
| ```csv title="chr21.map" | ||
| chr id centimorgan position |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Your header here is centimorgan position while in the description you say that it should be pos and cm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The header is in fact not parsed but can be present, so the columns names in the map file are not useful.
| - `--map_sep`: Field separator used in the map file (e.g. "\t", " ", ",") | ||
| - `--map_header`: Whether the file contains a header row (`true` or `false`) | ||
| - `--map_col_names`: Ordered list of column names in the file |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think these could be easily inferred in the module so that we don't add new params to users (which means they might be more prone to errors).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shouldn't we have a test for all the other imputation tools that require a map? A test with and without a map.
PR checklist
This PR add genetic map support for phasing and imputation tool as well as auto convertion to all format.
nf-core pipelines lint).nextflow run . -profile test,docker --outdir <OUTDIR>).nextflow run . -profile debug,test,docker --outdir <OUTDIR>).docs/usage.mdis updated.docs/output.mdis updated.CHANGELOG.mdis updated.README.mdis updated (including new tool citations and authors/contributors).