Skip to content

Use tabix when splitting imputation server vcfs#60

Open
robkar wants to merge 1 commit intoRipkelab:masterfrom
robkar:imp_deploy_use_index
Open

Use tabix when splitting imputation server vcfs#60
robkar wants to merge 1 commit intoRipkelab:masterfrom
robkar:imp_deploy_use_index

Conversation

@robkar
Copy link
Copy Markdown
Contributor

@robkar robkar commented Feb 20, 2019

As discussed in emails with Stephan. Using the new --deploy option for pre- and postprocessing imputation server files, I found the conversion from chromosomal vcfs to dosage chunks too slow in some cases.

I updated the scripts to use tabix for deployed imputation files with indices, which improved performance substantially (now my problematic chunks finish before the cluster kills them).

The logic I wrote stops with an error if using --deploy and there are no index files (I think both Sanger and Michigan provide them?), but for other uses of vcf2dos (i.e. in-house imputation), the script should proceed by reading the full file if there is no index.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant