Skip to content

Commit d3f0823

Browse files
committed
update README
1 parent 7bf447e commit d3f0823

1 file changed

Lines changed: 68 additions & 61 deletions

File tree

README.md

Lines changed: 68 additions & 61 deletions
Original file line numberDiff line numberDiff line change
@@ -1,92 +1,99 @@
1-
# schneeberger-liver-transfusion
1+
# Immune cell dynamics deconvoluted by single-cell RNA sequencing in normothermic machine perfusion of the liver
22

3-
Analysis of scRNA-seq data for the Liver transfusion project by Stefan Schneeberger
3+
This repository contains a [nextflow](https://github.com/nextflow-io/nextflow/) workflow to reproduce the single-cell analysis of
44

5-
## Getting started
5+
> Hautz, Salcher, Fodor et al. (2022), Immune cell dynamics deconvoluted by single-cell RNA sequencing in normothermic machine perfusion of the liver.
66
7-
To make it easy for you to get started with GitLab, here's a list of recommended next steps.
7+
Raw sequencing data is availble from GEO ([GSE216584](www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE216584)).
8+
The preprocessed data and singularity containers required to run this workflow are available [from zenodo](https://doi.org/10.5281/zenodo.7249006). On zenodo, also the results (i.e. executed jupyter notebooks, plots, etc.) generated by this workflow can be downloaded.
89

9-
Already a pro? Just edit this README.md and make it your own. Want to make it easy? [Use the template at the bottom](#editing-this-readme)!
1010

11-
## Add your files
11+
## Launching the workflows
1212

13-
- [ ] [Create](https://docs.gitlab.com/ee/user/project/repository/web_editor.html#create-a-file) or [upload](https://docs.gitlab.com/ee/user/project/repository/web_editor.html#upload-a-file) files
14-
- [ ] [Add files using the command line](https://docs.gitlab.com/ee/gitlab-basics/add-file.html#add-a-file-using-the-command-line) or push an existing Git repository with the following command:
13+
### 1. Prerequisites
1514

16-
```
17-
cd existing_repo
18-
git remote add origin https://gitlab.i-med.ac.at/icbi-lab/data-analyses/schneeberger-liver-transfusion.git
19-
git branch -M main
20-
git push -uf origin main
21-
```
22-
23-
## Integrate with your tools
15+
* [Nextflow](https://www.nextflow.io/index.html#GetStarted), version 22.04.5 or higher
16+
* [Singularity/Apptainer](https://apptainer.org/), version 3.7 or higher (tested with 3.7.0-1.el7)
2417

25-
- [ ] [Set up project integrations](https://gitlab.i-med.ac.at/icbi-lab/data-analyses/schneeberger-liver-transfusion/-/settings/integrations)
18+
### 2. Obtain data
2619

27-
## Collaborate with your team
20+
Before launching the workflow, you need to obtain input data and singularity containers from zenodo.
21+
First of all, clone this repository:
2822

29-
- [ ] [Invite team members and collaborators](https://docs.gitlab.com/ee/user/project/members/)
30-
- [ ] [Create a new merge request](https://docs.gitlab.com/ee/user/project/merge_requests/creating_merge_requests.html)
31-
- [ ] [Automatically close issues from merge requests](https://docs.gitlab.com/ee/user/project/issues/managing_issues.html#closing-issues-automatically)
32-
- [ ] [Enable merge request approvals](https://docs.gitlab.com/ee/user/project/merge_requests/approvals/)
33-
- [ ] [Automatically merge when pipeline succeeds](https://docs.gitlab.com/ee/user/project/merge_requests/merge_when_pipeline_succeeds.html)
23+
```bash
24+
git clone https://github.com/icbi-lab/nmp-liver.git
25+
cd nmp-liver
26+
```
3427

35-
## Test and Deploy
28+
Then, within the repository, download the data archives and extract then to the corresponding directories:
3629

37-
Use the built-in continuous integration in GitLab.
30+
```bash
31+
# singularity containers
32+
wget "https://zenodo.org/record/7249006/files/containers.zip?download=1"
3833

39-
- [ ] [Get started with GitLab CI/CD](https://docs.gitlab.com/ee/ci/quick_start/index.html)
40-
- [ ] [Analyze your code for known vulnerabilities with Static Application Security Testing(SAST)](https://docs.gitlab.com/ee/user/application_security/sast/)
41-
- [ ] [Deploy to Kubernetes, Amazon EC2, or Amazon ECS using Auto Deploy](https://docs.gitlab.com/ee/topics/autodevops/requirements.html)
42-
- [ ] [Use pull-based deployments for improved Kubernetes management](https://docs.gitlab.com/ee/user/clusters/agent/)
43-
- [ ] [Set up protected environments](https://docs.gitlab.com/ee/ci/environments/protected_environments.html)
34+
# input data
35+
wget "https://zenodo.org/record/7249006/files/input_data.zip?download=1"
4436

45-
***
37+
unzip containers.zip
38+
unzip input_data.zip
39+
```
4640

47-
# Editing this README
41+
### 3. Configure nextflow
4842

49-
When you're ready to make this README your own, just edit this file and use the handy template below (or feel free to structure it however you want - this is just a starting point!). Thank you to [makeareadme.com](https://www.makeareadme.com/) for this template.
43+
Depending on your HPC/cloud setup you will need to adjust the nextflow profile in `nextflow.config`, to tell
44+
nextflow how to submit the jobs. Using a `withName:...` directive, special
45+
resources may be assigned to GPU-jobs. You can get an idea by checking out the `icbi_liver` profile - which we used to run the
46+
workflow on our on-premise cluster.
5047

51-
## Suggestions for a good README
52-
Every project is different, so consider which of these sections apply to yours. The sections used in the template are suggestions for most open source projects. Also keep in mind that while a README can be too long and detailed, too long is better than too short. If you think your README is too long, consider utilizing another form of documentation rather than cutting out information.
48+
### 4. Launch the workflows
5349

54-
## Name
55-
Choose a self-explaining name for your project.
50+
```bash
51+
nextflow run main.nf -resume -profile <YOUR_PROFILE> \
52+
--outdir "./data/results"
53+
```
5654

57-
## Description
58-
Let people know what your project can do specifically. Provide context and add a link to any reference visitors might be unfamiliar with. A list of Features or a Background subsection can also be added here. If there are alternatives to your project, this is a good place to list differentiating factors.
55+
## Structure of this repository
5956

60-
## Badges
61-
On some READMEs, you may see small images that convey metadata, such as whether or not all the tests are passing for the project. You can use Shields to add some to your README. Many services also have instructions for adding a badge.
57+
* `analyses`: Place for e.g. jupyter/rmarkdown notebooks, gropued by their respective (sub-)workflows.
58+
* `bin`: executable scripts called by the workflow
59+
* `conf`: nextflow configuration files for all processes
60+
* `containers`: place for singularity image files. Not part of the git repo and gets created by the download command.
61+
* `data`: place for input data and results in different subfolders. Gets populated by the download commands and by running the workflows.
62+
* `lib`: custom libraries and helper functions
63+
* `modules`: nextflow DSL2.0 modules
64+
* `tables`: contains static content that should be under version control (e.g. manually created tables)
6265

63-
## Visuals
64-
Depending on what you are making, it can be a good idea to include screenshots or even a video (you'll frequently see GIFs rather than actual videos). Tools like ttygif can help, but check out Asciinema for a more sophisticated method.
6566

66-
## Installation
67-
Within a particular ecosystem, there may be a common way of installing things, such as using Yarn, NuGet, or Homebrew. However, consider the possibility that whoever is reading your README is a novice and would like more guidance. Listing specific steps helps remove ambiguity and gets people to using your project as quickly as possible. If it only runs in a specific context like a particular programming language version or operating system or has dependencies that have to be installed manually, also add a Requirements subsection.
67+
## Workflow description
6868

69-
## Usage
70-
Use examples liberally, and show the expected output if you can. It's helpful to have inline the smallest example of usage that you can demonstrate, while providing links to more sophisticated examples if they are too long to reasonably include in the README.
69+
The analysis workflow comprises the followin steps:
70+
* QC of the unfiltered input data
71+
* Cell-type annotation
72+
* Pseudobulk generation and DE analysis with DESeq2
73+
* Subcluster analysis of Macrophages/Monocytes and Neutrophils
74+
* Comparison of timepoints T0 vs T1
7175

72-
## Support
73-
Tell people where they can go to for help. It can be any combination of an issue tracker, a chat room, an email address, etc.
76+
## Contact
7477

75-
## Roadmap
76-
If you have ideas for releases in the future, it is a good idea to list them in the README.
78+
For reproducibility issues or any other requests regarding single-cell data analysis, please use the [issue tracker](https://github.com/nmp-liver/issues). For anything else, you can reach out to the corresponding author(s) as indicated in the manuscript.
7779

78-
## Contributing
79-
State if you are open to contributions and what your requirements are for accepting them.
80+
## Notes on reproducibility
8081

81-
For people who want to make changes to your project, it's helpful to have some documentation on how to get started. Perhaps there is a script that they should run or some environment variables that they need to set. Make these steps explicit. These instructions could also be useful to your future self.
82+
We aimed at making this workflow reproducible by providing all input data, containerizing all software
83+
dependencies and integrating all analysis steps into a nextflow workflow.
84+
In theory, this allows to execute the workflow on any system that can run nextflow and singularity.
85+
Unfortunately, some single cell analysis algorithms (in particular scVI and UMAP) will yield
86+
slightly different results on different hardware, trading off computational reproducibility for a
87+
significantly faster runtime. In particular, results will differ when changing the number of cores, or
88+
when running on a CPU/GPU of a different architecture. See also https://github.com/scverse/scanpy/issues/2014 for a discussion.
8289

83-
You can also document commands to lint the code or run tests. These steps help to ensure high code quality and reduce the likelihood that the changes inadvertently break something. Having instructions for running tests is especially helpful if it requires external setup, such as starting a Selenium server for testing in a browser.
90+
Since the cell-type annotation depends on clustering, and the clustering depends on the neighborhood graph,
91+
which again depends on the scVI embedding, running the workflow on a different machine will likely break the cell-type labels.
8492

85-
## Authors and acknowledgment
86-
Show your appreciation to those who have contributed to the project.
93+
Below is the hardware we used to execute the workflow. Theoretically,
94+
any CPU/CPU of the same generation shoud produce identical results, but we did not have the chance to test this yet.
8795

88-
## License
89-
For open source projects, say how it is licensed.
96+
* Compute node CPU: `Intel(R) Xeon(R) CPU E5-2699A v4 @ 2.40GHz` (2x)
97+
* GPU node CPU: `EPYC 7352 24-Core` (2x)
98+
* GPU node GPU: `Nvidia Quadro RTX 8000 GPU`
9099

91-
## Project status
92-
If you have run out of energy or time for your project, put a note at the top of the README saying that development has slowed down or stopped completely. Someone may choose to fork your project or volunteer to step in as a maintainer or owner, allowing your project to keep going. You can also make an explicit request for maintainers.

0 commit comments

Comments
 (0)