Skip to content

Update usage docu for newcomers#1037

Merged
d4straub merged 26 commits into
nf-core:devfrom
d4straub:update-docu-for-newcomers
Jun 10, 2026
Merged

Update usage docu for newcomers#1037
d4straub merged 26 commits into
nf-core:devfrom
d4straub:update-docu-for-newcomers

Conversation

@d4straub

@d4straub d4straub commented May 19, 2026

Copy link
Copy Markdown
Collaborator

This is a first proposal to make the usage documentation more friendly to scientists new to the field. It attempts to explain the defaults of the pipeline and when to deviate from them.
Up for discussion.

PR checklist

  • This comment contains a description of changes (with reason).
  • If you've fixed a bug or added code that should be tested, add tests!
  • If you've added a new tool - have you followed the pipeline conventions in the contribution docs
  • If necessary, also make a PR on the nf-core/mag branch on the nf-core/test-datasets repository.
  • Make sure your code lints (nf-core pipelines lint).
  • Ensure the test suite passes (nextflow run . -profile test,docker --outdir <OUTDIR>).
  • Check for unexpected warnings in debug mode (nextflow run . -profile debug,test,docker --outdir <OUTDIR>).
  • Usage Documentation in docs/usage.md is updated.
  • Output Documentation in docs/output.md is updated.
  • CHANGELOG.md is updated.
  • README.md is updated (including new tool citations and authors/contributors).

@jfy133

jfy133 commented May 27, 2026

Copy link
Copy Markdown
Member

@nf-core-bot fix linting

@jfy133 jfy133 linked an issue May 27, 2026 that may be closed by this pull request
@jfy133 jfy133 marked this pull request as ready for review June 2, 2026 09:38
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
jfy133 and others added 3 commits June 2, 2026 15:13
Co-authored-by: Diego Alvarez S. <dialvarezs@gmail.com>
Co-authored-by: Diego Alvarez S. <dialvarezs@gmail.com>
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
jfy133 and others added 3 commits June 4, 2026 09:24
@jfy133 jfy133 requested a review from dialvarezs June 4, 2026 08:50

@d4straub d4straub left a comment

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think looks pretty good already.

Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md
With high-depth long reads, long-read assembly typically yields more coherent results ([Agustinho et al. 2024](https://doi.org/10.1038/s41592-024-02262-1)).
Short-read-first assembly performs better with high-depth short reads or low-quality long reads and produces more fragmented but higher-accuracy assemblies ([Overholt et al. 2020](https://doi.org/10.1111/1462-2920.15186), [Meyer et al. 2022](https://doi.org/10.1038/s41592-022-01431-4)).

**Recommendation**: run as many assemblers as computationally feasible.

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it would be helpful to add here approximate RAM and time requirements. Same for binners. Maybe we can add numbers that we produce for the manuscrtipt review here? Alternatively, a student of mine is working on checking computational requirements and we could add that numbers when they are available.

@jfy133 jfy133 Jun 5, 2026

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm always loathe to do make such things outside a dedicated study (which I really REALLY want to see, but few do) as it can be so highly dependent on the data and complexity input... but yes I guess we could add it from the stuff @dialvarezs is adding.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess we can with that provide read counts, but I assume the CAMI stuff is static? And then what metric of metagenome complexity can we use?

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

More context: I had this problem with a taxprofiler paper we just submitted, the only two metrics we could really report on were RAM and harddrive usagee...

Number of CPUs you can choose, you can set to 1 or 1000 it depends on how much is available, and time also depends many different factors: how old the CPU is, what type it is, how busy the scheduler is, what the IO bandwidth is, which are all driven by the infrastructure not the pipeline/tools themselves

@d4straub d4straub Jun 5, 2026

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would add this to give an approximation, i.e. does the specific assembler need 1 hour or 1 day, what tool needs longer? Is the RAM 10GB, 100GB, or 1TB? Number of CPUs kept on default. And thats it. After all its for people who are new to this to give an idea what to expect. We have that questions occationally in slack afaik.
And also how computational requirements relate to each other. I assume that in many cases the RAM and time ratios between alternative tools hold true, e.g. when one needs 10GB RAM and another needs 100GB, then the latter will usually need 10x more than the former. And that would be helpful in itself as well, I think.

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

But we can also postpone that and attempt to add it in a separate PR? No need to complicate things?

Comment thread docs/usage/new_to_mag.md Outdated
Comment thread docs/usage/new_to_mag.md Outdated
@d4straub

d4straub commented Jun 8, 2026

Copy link
Copy Markdown
Collaborator Author

So how its going here? Do we wait for more opinions?

@jfy133 jfy133 self-requested a review June 10, 2026 06:58

@jfy133 jfy133 left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Apparently I dreamt I merged this 🤦

Comment thread CHANGELOG.md Outdated
@d4straub

Copy link
Copy Markdown
Collaborator Author

I can merge it bypassing merging rules (because confirm-pass wont pass ever)

@d4straub d4straub merged commit e8eb3e1 into nf-core:dev Jun 10, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Create newcomer guidance documentation page

4 participants