yuw444
diff --git a/‎README.md‎
Lines changed: 72 additions & 52 deletions b/‎README.md‎
Lines changed: 72 additions & 52 deletions
diff --git a/‎docs/authors.html‎
Lines changed: 2 additions & 2 deletions b/‎docs/authors.html‎
Lines changed: 2 additions & 2 deletions
@@ -1,22 +1,40 @@
 # Rforce
 
-* We here to introduce [Rforce:Random Forests for Composite Endpoints](https://onlinelibrary.wiley.com/doi/10.1002/sim.70413), consisting of non-fatal composite events and terminal events. It utilizes generalized estimating equations(GEE) to build trees and handles the dependent censoring due to the terminal events with the concept of pseudo-at-risk duration. 
+**Rforce** implements the methodology described in [Rforce: Random Forests for Composite Endpoints](https://onlinelibrary.wiley.com/doi/10.1002/sim.70413), which models composite endpoints consisting of **non-fatal events and terminal events**.
 
-* This work has been awared as one of 4 receipents in American statistical Association [2024 Student Paper Competition](https://community.amstat.org/jointscsg-section/awards/student-paper-competition) (Section on Statistical Computing and Section on Statistical Graphics). It is now publised in Statistics in Medicine, PMID: [41640374](https://pubmed.ncbi.nlm.nih.gov/41640374/) DOI: [10.1002/sim.70413](https://onlinelibrary.wiley.com/doi/10.1002/sim.70413).
+The method builds random forests using **generalized estimating equations (GEE)** and handles dependent censoring caused by terminal events using the concept of **pseudo-at-risk duration**.
 
-* It not only gives the methodological soundness, but also offers both `R API` and `C API`, provides both computing and memory efficiency, enables parallel mechanism through [OpenMP](https://www.openmp.org/) while ensure the [reproducibility](https://yuw444.github.io/Rforce/articles/get-started.html#in-1-equivalent-command). 
+This work received the **2024 Student Paper Competition Award** from the American Statistical Association (ASA), jointly from the [Section on Statistical Computing and Section on Statistical Graphics](https://community.amstat.org/jointscsg-section/awards/student-paper-competition).
 
+The paper is published in *Statistics in Medicine*:
+
+- PMID: [41640374](https://pubmed.ncbi.nlm.nih.gov/41640374/)
+- DOI: [10.1002/sim.70413](https://onlinelibrary.wiley.com/doi/10.1002/sim.70413)
+
+The software provides both:
+
+- **R API**
+- **C API**
+
+Key features include:
+
+- High computational and memory efficiency
+- Parallel computation using [OpenMP](https://www.openmp.org/)
+- Reproducible results (see the reproducibility example [here](https://yuw444.github.io/Rforce/articles/get-started.html#in-1-equivalent-command))
 
 ---
-## Installation
 
-### Dependecy
+# Installation
+
+## Dependencies
 
-* `cmake>=3.16.0`: compile tool for `C API`
-* `OpenMP`: enable the parallel mechanism
-* `R>=4.3.3`: enable interaction with `R`
+- `cmake >= 3.16.0` – build system for the C API
+- `OpenMP` – parallel computing
+- `R >= 4.3.3` – R interface
 
-### R API
+---
+
+## Install R API
 
 ```r
 # install.packages("devtools")
@@ -25,7 +43,7 @@ devtools::install_github("yuw444/Rforce")
 
 ### C API
 
-```
+```bash
 git clone https://github.com/yuw444/Rforce.git
 cd Rforce
 mkdir build
@@ -34,16 +52,18 @@ cmake ..
 make
 ```
 
-A `CMakeLists.txt` file is provided in the repository
+A `CMakeLists.txt` file is provided in the repository.
 
 ---
+
 ## Usage
 
 ### R Examples
 
-* Examples: [Get Started](https://yuw444.github.io/Rforce/articles/get-started.html). 
+* Examples: [Get Started](https://yuw444.github.io/Rforce/articles/get-started.html).
 
 ---
+
 ### Shell Scripts
 
 ```bash
@@ -70,39 +90,39 @@ Rforce train <options>
 
 **Options:**
 
-| Option | Description | Required/Optional | Default |
-|:-----------------------------|:------------|:------------------|:--------|
-| `-d, --designMatrixY=<str>` | Path to design matrix | **Required** | |
-| `-a, --auxiliary=<str>` | Path to auxiliary features | **Required** | |
-| `-u, --unitsOfCPIU=<str>` | Path to unitsOfCPIU file | **Required** | |
-| `-o, --out=<str>` | Path to output directory | Optional | Current working directory |
-| `-v, --verbose=<int>` | Verbosity level (0–3) | Optional | 0 |
-| `-m, --maxDepth=<int>` | Maximum tree depth | Optional | 10 |
-| `-n, --minNodeSize=<int>` | Minimum node size | Optional | 2 × len(unitsOfCPIU) - 1 |
-| `-g, --gain=<float>` | Minimum gain for split | Optional | 0.0 (likelihood-based) or 1.3 (GEE-based) |
-| `-t, --mtry=<int>` | Number of variables to try during splitting | Optional | √(number of variables) |
-| `-s, --nsplits=<int>` | Number of splits to try per variable | Optional | 10 |
-| `-r, --nTrees=<int>` | Number of trees | Optional | 200 |
-| `-e, --seed=<int>` | Random seed | Optional | 926 |
-| `-p, --nPerms=<int>` | Number of permutations for variable importance | Optional | 10 |
-| `-u, --nVars=<int>` | Number of variables in the design matrix | Optional | Number of columns |
-| `-i, --pathVarIds=<str>` | Variable IDs (categorical variables supported via repeated IDs) | Optional | |
-| `-x, --iDot` | Output tree DOT files | Optional | False |
-| `-k, --k=<int>` | Bayesian estimator parameter for leaf output | Optional | 4 |
-| `-L, --long` | Use multiple rows per patient (RF-SLAM style) | Optional | |
-| `-N, --nopseudo` | Do not estimate pseudo risk time | Optional | |
-| `-P, --pseudorisk1` | Use original pseudo-risk time (population level) | Optional | |
-| `-B, --pseudorisk2` | Recalculate pseudo-risk time at each tree (default) | Optional | |
-| `-D, --dynamicrisk` | Dynamically estimate pseudo-risk time at each split | Optional | |
-| `-F, --nophi` | Fix φ = 1, do not estimate φ | Optional | |
-| `-P, --phi1` | Estimate φ at population level | Optional | |
-| `-H, --phi2` | Estimate φ at tree level (default) | Optional | |
-| `-Y, --dynamicphi` | Dynamically estimate φ at each split | Optional | |
-| `-G, --gee` | Use GEE approach | Optional | |
-| `-A, --padjust=<str>` | p-value adjustment method (`bonferroni`, `holm`, `hochberg`, `hommel`, `BH`, `BY`, `none`) | Optional | `BH` |
-| `-I, --interaction` | Add interaction terms for GEE | Optional | NULL |
-| `-S, --asym` | Use asymptotic approach | Optional | |
-| `-T, --threads=<int>` | Number of parallel computing threads | Optional | 8 |
+| Option                        | Description                              | Required/Optional | Default                          |
+|-------------------------------|------------------------------------------|-------------------|----------------------------------|
+| `-d, --designMatrixY=<str>`   | Path to design matrix                   | **Required**      |                                  |
+| `-a, --auxiliary=<str>`       | Path to auxiliary features              | **Required**      |                                  |
+| `-u, --unitsOfCPIU=<str>`     | Path to unitsOfCPIU file                | **Required**      |                                  |
+| `-o, --out=<str>`             | Path to output directory                | Optional          | Current working directory        |
+| `-v, --verbose=<int>`         | Verbosity level (0–3)                   | Optional          | 0                                |
+| `-m, --maxDepth=<int>`        | Maximum tree depth                      | Optional          | 10                               |
+| `-n, --minNodeSize=<int>`     | Minimum node size                       | Optional          | 2 × len(unitsOfCPIU) - 1         |
+| `-g, --gain=<float>`          | Minimum gain for split                  | Optional          | 0.0 (likelihood-based) or 1.3 (GEE-based) |
+| `-t, --mtry=<int>`            | Number of variables to try during splitting | Optional      | √(number of variables)           |
+| `-s, --nsplits=<int>`         | Number of splits to try per variable    | Optional          | 10                               |
+| `-r, --nTrees=<int>`          | Number of trees                         | Optional          | 200                              |
+| `-e, --seed=<int>`            | Random seed                             | Optional          | 926                              |
+| `-p, --nPerms=<int>`          | Number of permutations for variable importance | Optional  | 10                               |
+| `-u, --nVars=<int>`           | Number of variables in the design matrix | Optional         | Number of columns                |
+| `-i, --pathVarIds=<str>`      | Variable IDs (categorical variables supported via repeated IDs) | Optional | |
+| `-x, --iDot`                  | Output tree DOT files                   | Optional          | False                            |
+| `-k, --k=<int>`               | Bayesian estimator parameter for leaf output | Optional      | 4                                |
+| `-L, --long`                  | Use multiple rows per patient (RF-SLAM style) | Optional      |                                  |
+| `-N, --nopseudo`              | Do not estimate pseudo risk time        | Optional          |                                  |
+| `-P, --pseudorisk1`           | Use original pseudo-risk time (population level) | Optional | |
+| `-B, --pseudorisk2`           | Recalculate pseudo-risk time at each tree (default) | Optional | |
+| `-D, --dynamicrisk`           | Dynamically estimate pseudo-risk time at each split | Optional | |
+| `-F, --nophi`                 | Fix φ = 1, do not estimate φ            | Optional          |                                  |
+| `-P, --phi1`                  | Estimate φ at population level          | Optional          |                                  |
+| `-H, --phi2`                  | Estimate φ at tree level (default)      | Optional          |                                  |
+| `-Y, --dynamicphi`            | Dynamically estimate φ at each split    | Optional          |                                  |
+| `-G, --gee`                   | Use GEE approach                        | Optional          |                                  |
+| `-A, --padjust=<str>`         | p-value adjustment method (`bonferroni`, `holm`, `hochberg`, `hommel`, `BH`, `BY`, `none`) | Optional | `BH` |
+| `-I, --interaction`           | Add interaction terms for GEE           | Optional          | NULL                             |
+| `-S, --asym`                  | Use asymptotic approach                 | Optional          |                                  |
+| `-T, --threads=<int>`         | Number of parallel computing threads    | Optional          | 8                                |
 
 ---
 
@@ -116,11 +136,11 @@ Rforce predict <options>
 
 **Options:**
 
-| Option | Description | Required/Optional | Default |
-|:-------------------|:------------|:------------------|:--------|
-| `-m, --model=<str>` | Path to trained model | **Required** | |
-| `-t, --test=<str>` | Path to test data | **Required** | |
-| `-o, --out=<str>` | Path to output directory | Optional | Current working directory |
+| Option                | Description               | Required/Optional | Default                   |
+|-----------------------|---------------------------|-------------------|---------------------------|
+| `-m, --model=<str>`   | Path to trained model     | **Required**      |                           |
+| `-t, --test=<str>`    | Path to test data         | **Required**      |                           |
+| `-o, --out=<str>`     | Path to output directory  | Optional          | Current working directory |
 
 ---
 
@@ -146,8 +166,8 @@ Rforce predict -m output_folder/model.rforce -t test_data.csv -o prediction_resu
 - Dynamic options (`--dynamicrisk`, `--dynamicphi`) allow estimates at each split for more flexibility.
 - Parallel computation is supported via the `--threads` option.
 - GEE-based splitting with p-value adjustment is available.
-- An R API is currently actively developing which includes
-  - Classical survivial data generation
+- An R API is currently actively developing which includes:
+  - Classical survival data generation
   - Composite endpoint data generation
   - [`Wcompo`](https://cran.r-project.org/web/packages/Wcompo/index.html) methodology realization
   - An R interface to **Rforce**