|
| 1 | +# salmon 2.0.1 |
| 2 | + |
| 3 | +A patch release with two internal accuracy improvements to selective-alignment |
| 4 | +quantification. **No breaking changes, no new options, and no output-format |
| 5 | +changes** — indices built with 2.0.0 work unchanged (no rebuild needed), and |
| 6 | +`quant.sf` / inferential-replicate outputs are produced exactly as before. |
| 7 | + |
| 8 | +## What changed |
| 9 | + |
| 10 | +Both changes refine how fragments are weighted across **multimapping** |
| 11 | +transcripts (paralogs and alternative isoforms), bringing the estimates closer to |
| 12 | +C++ salmon's: |
| 13 | + |
| 14 | +- **Fragment-length probability in the equivalence-class weights.** The |
| 15 | + selective-alignment equivalence-class weights now include the fragment-length |
| 16 | + term (salmon's `logFragProb`), which had been omitted. Without it the |
| 17 | + conditional weights were flattened for transcripts whose implied insert size |
| 18 | + differs, slightly degrading how shared reads are split. |
| 19 | +- **Abundance-aware fragment-length-distribution training.** The fragment-length |
| 20 | + distribution is now trained from fragments sampled by their abundance-aware |
| 21 | + posterior (matching salmon), so reads shared between near-duplicate transcripts |
| 22 | + contribute the dominant transcript's implied length. This concentrates the |
| 23 | + learned distribution as C++ salmon does. |
| 24 | + |
| 25 | +## Impact |
| 26 | + |
| 27 | +On benchmark data the per-transcript agreement with C++ salmon improves, and the |
| 28 | +remaining difference sits at the run-to-run noise floor: |
| 29 | + |
| 30 | +- Real data (GEUVADIS `ERR188044`, GRCh38 cDNA): per-transcript Spearman vs C++ |
| 31 | + salmon 0.9705 → **0.9710**; NumReads Pearson **0.9996**; mapping rate |
| 32 | + unchanged. |
| 33 | +- Simulated ground truth (human, polyester): the per-transcript Spearman gap to |
| 34 | + C++ salmon shrinks to ≤ 0.002 across difficulty levels and both the default |
| 35 | + (VBEM) and `--useEM` optimizers — at/near the noise floor. |
| 36 | + |
| 37 | +Performance and memory are unchanged, and runs remain reproducible (run-to-run |
| 38 | +variation is unaffected). Existing 2.0.0 results are still valid; re-quantifying |
| 39 | +picks up the slightly improved estimates. |
| 40 | + |
| 41 | +## Installation |
| 42 | + |
| 43 | +Unchanged from 2.0.0: |
| 44 | + |
| 45 | +```sh |
| 46 | +curl --proto '=https' --tlsv1.2 -LsSf \ |
| 47 | + https://github.com/COMBINE-lab/salmon/releases/latest/download/salmon-cli-installer.sh | sh |
| 48 | +# or: cargo install salmon-cli | conda install -c bioconda -c conda-forge salmon |
| 49 | +``` |
| 50 | + |
| 51 | +## Documentation |
| 52 | + |
| 53 | +Full docs at **<https://combine-lab.github.io/salmon>**. |
0 commit comments