End-to-end, reproducible code + text for the manuscript
“Paracetamol for Renal Recovery in Severe Malaria (PARIST Trial)”
- Project scope
- Folder structure
- Quick-start
- Re-running the full pipeline
- Continuous integration
- Authorship & contributions
- Data governance
- License & citation
- Contact
This repository is the single source of truth for:
- De-identified analysis datasets
- R scripts and R Markdown notebooks
- Auto-generated figures & tables
- The manuscript in markdown/LaTeX (rendered to PDF & HTML)
- GitHub Actions CI to guarantee that analyses and rendering are repeatable
Why a public workflow?
Releasing code and de-identified data (+ exact package versions viarenv) fulfils FAIR-data principles, improves transparency, and accelerates follow-up research.
parist-trial-workflow/
├── analysis/ # R scripts & notebooks
│ ├── 01\_data-cleaning.R
│ ├── 02\_descriptive.Rmd
│ ├── 03\_RMST\_analysis.R <- primary analysis
│ └── utils/plot\_theme.R
├── data/
│ ├── raw/ # 🔒 not in git (see .gitignore)
│ └── derived/ # de-identified CSVs shared openly
├── manuscript/
│ ├── PARIST\_manuscript.Rmd
│ ├── references.bib
│ └── journal-template.tex
├── figures/ # auto-generated PNG/PDF/SVG
├── docs/ # rendered manuscript (CI artifact / GitHub Pages)
├── archive/ # original Word files & approvals
├── .github/workflows/ci.yml
├── renv.lock # pinned R package versions
├── LICENSE | CITATION.cff
└── README.md # you are here
git clone https://github.com/<ORG>/parist-trial-workflow.git
cd parist-trial-workflow
# one-liner to restore exact packages
R -q -e "install.packages('renv'); renv::restore()"Rscript analysis/03_RMST_analysis.RR -q -e "rmarkdown::render('manuscript/PARIST_manuscript.Rmd',
output_dir = 'docs')"
open docs/PARIST_manuscript.pdf| Stage | Script / tool | Output |
|---|---|---|
| 1 Import & clean | 01_data-cleaning.R |
data/derived/parist_analysis_set.csv |
| 2 Descriptive stats | 02_descriptive.Rmd |
figures/Table1.png |
| 3 Primary RMST + adjusted models | 03_RMST_analysis.R |
figures/Fig2_RMST_curve.pdf, analysis/results/rmst_summary.csv |
The CI workflow executes exactly these steps on every push / PR to guarantee reproducibility.
.github/workflows/ci.yml:
- Checks out the repo.
- Restores the
renvcache. - Runs analysis scripts; fails if any exit non-zero.
- Renders the manuscript to PDF.
- Uploads artifacts & (optionally) deploys
docs/to GitHub Pages.
Status badge at the top of this README shows the latest build.
We adopt the CRediT taxonomy. Example (edit as needed):
| Author | Contribution(s) |
|---|---|
| A.B. | Conceptualization, Methodology, Writing – original draft |
| C.D. | Formal analysis, Software, Visualization |
| E.F. | Investigation, Data curation |
| G.H. | Supervision, Funding acquisition |
➡️ How to contribute:
- Create a feature branch:
git checkout -b feat/<slug> - Commit with Conventional Commit prefix (
feat:,fix:,docs:…) - Open a Pull Request; CI must be green & 2 reviews obtained.
- Squash-merge to
main.
| Data tier | Location | Policy |
|---|---|---|
| Raw identifiable | Institution file-share only | Never pushed to GitHub |
| Derived de-identified | data/derived/ |
Commit OK; shared under CC-BY 4.0 |
| Post-acceptance | Zenodo / Mendeley Data | DOI back-linked here |
Corresponding author stores the encryption key for raw ↔︎ derived mapping.
- Code: MIT License – see
LICENSE. - Text & figures: Creative Commons CC-BY 4.0.
A ready-made CITATION.cff lets GitHub generate a citation snippet.
When citing, please use the DOI badge at the top of this README once minted.
Corresponding author: Dr Paasi George – [georgepaasi8@gmail.com)
For issues or feature requests open a GitHub Issue or start a Discussion.
Happy analysing – and thanks for advancing open, reproducible clinical science!