1.5.0 - UNSC Punic
Added and Fixed
- Template update to 3.3.2.
- Addition of the
--split_telomereboolean flag, this is false by default.- When
truethe pipeline will split the telomere file into a 5 and 3 prime file.
- When
- Update
ACCESSORY_FILESsubworkflow:- Remove
GET_LARGEST_SCAFFOLDas we no longer need it, this was needed for TABIX so that the correct index file was used. This was used by theTELO_FINDERandGAP_FINDERsubworkflows.
- Remove
- Update
TELO_FINDERsubworkflow:- Remove
GAWK_MAP_TELOas it is no longer needed. - Remove
GAWK_CLEAN_TELOMEREas it is no longer needed. The reason for its inclusion has been fixed. - Update
EXTRACT_TELOtoEXTRACT_TELOMEREwhich also removed the use of thecat {file} | awkpattern, replacing it with justawk. This was supposed to happen in1.4.0, but was forgotten with the files lying dormant in the repo. - Refactor of the
TELO_FINDERsubworkflow, introducing theTELO_EXTRACTIONsubworkflow which is run per telo file. With the introduction ofsplit_telomerethis can be 3 files.
- Remove
- Update
LONGREAD_COVERAGEsubworkflow:- Remove
GRAPH_OVERALL_COVERAGEas it is not in use.
- Remove
- Better formatting in some files.
- Moved
GAWK_UPPER_SEQUENCEfrom theTELO_FINDERsubworkflow to the first step of the maincurationpretextworkflow, this simply makes more sense. - Removed no longer needed scripts from bin.
- Added the module
GAWK_SPLIT_DIRECTIONSmodule, a local copy of the nf-coreGAWKmodule. - Added the
gawk_split_directions.awkscript for split telomere. - Addition of GUNZIP for the input reference genome.
- Update tests.
- Added an "AUTO" value to the
--alignerarg. If a genome is >5Gb it will use minimap2 else bwamem2. - Parity update for the base.config to match TreeVal.
- Minor Doc updates.
- Comment out the CONDA workflow requirement, pipeline does not support conda.
Full Changelog: 1.4.2...1.5.0