|
3 | 3 | The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/) |
4 | 4 | and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html). |
5 | 5 |
|
| 6 | +## [[1.5.0](https://github.com/sanger-tol/curationpretext/releases/tag/1.5.0)] - UNSC Punic - [2025-08-04] |
| 7 | + |
| 8 | +### Added and Fixed |
| 9 | + |
| 10 | +- Template update to 3.3.3. <TODO in next PR>. |
| 11 | +- Addition of the `--split_telomere` boolean flag, this is false by default. |
| 12 | + - When `true` the pipeline will split the telomere file into a 5 and 3 prime file. |
| 13 | +- Update `ACCESSORY_FILES` subworkflow: |
| 14 | + - Remove `GET_LARGEST_SCAFFOLD` as we no longer need it, this was needed for TABIX so that the correct index file was used. This was used by the `TELO_FINDER` and `GAP_FINDER` subworkflows. |
| 15 | +- Update `TELO_FINDER` subworkflow: |
| 16 | + - Remove `GAWK_MAP_TELO` as it is no longer needed. |
| 17 | + - Remove `GAWK_CLEAN_TELOMERE` as it is no longer needed. The reason for its inclusion has been fixed. |
| 18 | + - Update `EXTRACT_TELO` to `EXTRACT_TELOMERE` which also removed the use of the `cat {file} | awk` pattern, replacing it with just `awk`. This was supposed to happen in `1.4.0`, but was forgotten with the files lying dormant in the repo. |
| 19 | + - Refactor of the `TELO_FINDER` subworkflow, introducing the `TELO_EXTRACTION` subworkflow which is run per telo file. With the introduction of `split_telomere` this can be 3 files. |
| 20 | +- Update `LONGREAD_COVERAGE` subworkflow: |
| 21 | + - Remove `GRAPH_OVERALL_COVERAGE` as it is not in use. |
| 22 | +- Better formatting in some files. |
| 23 | +- Moved `GAWK_UPPER_SEQUENCE` from the `TELO_FINDER` subworkflow to the first step of the main `curationpretext` workflow, this simply makes more sense. |
| 24 | +- Removed no longer needed scripts from bin. |
| 25 | +- Added the module `GAWK_SPLIT_DIRECTIONS` module, a local copy of the nf-core `GAWK` module. |
| 26 | +- Added the `gawk_split_directions.awk` script for split telomere. |
| 27 | +- Addition of GUNZIP for the input reference genome. |
| 28 | +- Update tests. |
| 29 | + |
| 30 | +### Paramters |
| 31 | + |
| 32 | +| Old Version | New Versions | |
| 33 | +| ----------- | ---------------- | |
| 34 | +| NA | --split_telomere | |
| 35 | + |
| 36 | +### Software Dependencies |
| 37 | + |
| 38 | +Note, since the pipeline is using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference. |
| 39 | + |
| 40 | +| Module | Old Version | New Versions | |
| 41 | +| ------------------------ | ------------- | ------------- | |
| 42 | +| `GRAPH_OVERALL_COVERAGE` | perl=5.26.2 | REMOVED | |
| 43 | +| `EXTRACT_TELO` | coreutils=9.1 | REMOVED | |
| 44 | +| `EXTRACT_TELOMERE` | NA | coreutils=9.1 | |
| 45 | +| `GAWK_CLEAN_TELOMERE` | 5.3.0 | REMOVED | |
| 46 | +| `GAWK_MAP_TELO` | 5.3.0 | REMOVED | |
| 47 | +| `GET_LARGEST_SCAFF` | coreutils=9.1 | REMOVED | |
| 48 | +| `GUNZIP` | NA | 1.13 | |
| 49 | +| `GAWK_SPLIT_DIRECTIONS` | NA | 5.3.0 | |
| 50 | + |
6 | 51 | ## [[1.4.2](https://github.com/sanger-tol/curationpretext/releases/tag/1.4.2)] - UNSC Nereid (H2) - [2025-07-28] |
7 | 52 |
|
8 | 53 | ### Added and Fixed |
|
0 commit comments