NVlabs
diff --git a/‎.coderabbit.yaml‎
Lines changed: 10 additions & 0 deletions b/‎.coderabbit.yaml‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 2 additions & 1 deletion b/‎.gitignore‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎LICENSE‎
Lines changed: 1 addition & 1 deletion b/‎LICENSE‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎README.md‎
Lines changed: 5 additions & 2 deletions b/‎README.md‎
Lines changed: 5 additions & 2 deletions
diff --git a/‎docs/conf.py‎
Lines changed: 1 addition & 1 deletion b/‎docs/conf.py‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/examples.rst‎
Lines changed: 221 additions & 1 deletion b/‎docs/examples.rst‎
Lines changed: 221 additions & 1 deletion
diff --git a/‎docs/index.rst‎
Lines changed: 23 additions & 0 deletions b/‎docs/index.rst‎
Lines changed: 23 additions & 0 deletions
diff --git a/‎gbrl/__init__.py‎
Lines changed: 2 additions & 2 deletions b/‎gbrl/__init__.py‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎gbrl/common/__init__.py‎
Lines changed: 1 addition & 1 deletion b/‎gbrl/common/__init__.py‎
Lines changed: 1 addition & 1 deletion
@@ -0,0 +1,10 @@
+# Repository-specific override
+reviews:
+  profile: "assertive"
+  auto_review:
+    enabled: true
+    auto_incremental_review: true
+    drafts: false
+    ignore_title_keywords:
+      - "WIP"
+      - "DO NOT MERGE"
@@ -18,4 +18,5 @@ docs/_build/
 *.whl
 *ncu-rep
 *.sh
-local_binaries/
+local_binaries/
+tmp/
@@ -1,6 +1,6 @@
  MIT License
 
- Copyright (c) 2024-2025, NVIDIA Corporation. All rights reserved.
+ Copyright (c) 2024-2026, NVIDIA Corporation. All rights reserved.
 
  Permission is hereby granted, free of charge, to any person obtaining a
  copy of this software and associated documentation files (the "Software"),
 
@@ -54,12 +54,15 @@ For a detailed usage example, see `tutorial.ipynb`
     - MultiRMSE loss (only)
 - Categorical inputs
 - Input feature weights - (CPU/GPU)
+- Monotonic constraints - (CPU/GPU, policy only)
+
 ### GBT Inference
+
 - SGD optimizer - (CPU/GPU)
 - ADAM optimizer - (CPU only)
 - Control Variates (gradient variance reduction technique) - (CPU only)
 - Shared Tree for policy and value function - (CPU/GPU)
-- Linear and constant learning rate scheduler - (CPU/GPU only constant)
+- Linear and constant learning rate scheduler - (CPU/GPU, linear scheduler GPU only for Oblivious trees)
 - Support for up to two different optimizers (e.g, policy/value) - **(CPU/GPU if both are SGD)
 - SHAP value calculation
 
@@ -81,7 +84,7 @@ url={https://arxiv.org/abs/2407.08250}
 }
 ```
 # Licenses
-Copyright © 2024-2025, NVIDIA Corporation. All rights reserved.
+Copyright © 2024-2026, NVIDIA Corporation. All rights reserved.
 
 This work is made available under the NVIDIA The MIT License. Click [here](https://github.com/NVlabs/gbrl/blob/master/LICENSE). to view a copy of this license.
 
@@ -29,7 +29,7 @@ def __getattr__(cls, name):
 
 sys.modules['gbrl.gbrl_cpp'] = Mock()
 project = 'GBRL'
-copyright = '2024-2025, NVIDIA Corporation'
+copyright = '2024-2026, NVIDIA Corporation'
 author = 'Benjamin Fuhrer, Chen Tessler, Gal Dalal'
 release = __version__
 version = "master (" + __version__ + " )"
 
@@ -338,4 +338,224 @@ SHAP values are calculated internally and can be plotted using the `SHAP library
     shap.plots.bar(explainable_values_action_2, ax=ax)
     ax.set_title("SHAP values Action 2")
 
-    plt.show()
+    plt.show()
+
+Learning Rate Schedulers
+------------------------
+GBRL supports learning rate scheduling to control the learning rate throughout training. Two schedulers are available:
+
+- **Constant** (default): Fixed learning rate throughout training
+- **Linear**: Linearly interpolates between an initial and final learning rate
+
+.. note::
+
+    Linear scheduler on GPU is only supported for oblivious trees (``grow_policy='oblivious'``).
+
+Constant Scheduler (Default)
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+.. code-block:: python
+
+    # Constant learning rate (default behavior)
+    optimizer = {
+        'algo': 'SGD',
+        'lr': 0.1,  # Fixed learning rate
+        'start_idx': 0,
+        'stop_idx': out_dim
+    }
+
+Linear Scheduler
+~~~~~~~~~~~~~~~~
+
+The linear scheduler interpolates the learning rate from ``lr`` (initial) to ``stop_lr`` (final) over ``T`` trees:
+
+.. math::
+
+    lr_t = lr + \frac{t}{T} \times (stop\_lr - lr)
+
+where :math:`t` is the current tree index (0-indexed, so :math:`t \in [0, T-1]`). The schedule covers trees 0 through T-1, and at tree T and beyond, the learning rate is held constant at ``stop_lr``. This means:
+
+- At tree 0: :math:`lr_0 = lr` (initial learning rate)
+- At tree T-1: :math:`lr_{T-1} = lr + \frac{T-1}{T} \times (stop\_lr - lr)` (approaching final learning rate)
+- At tree T and beyond: :math:`lr_t = stop\_lr` (held constant)
+
+**Edge Case (T=1):** When ``T=1``, the schedule contains only tree 0 which uses ``lr`` (since :math:`lr_0 = lr + 0/1 \times (stop\_lr - lr) = lr`). The interpolation phase is skipped, so tree 1 and all subsequent trees immediately use ``stop_lr``.
+
+**Parameter Constraints:**
+
+- ``T`` must be a positive integer (minimum 1). It should equal the number of trees you expect to build.
+- ``lr`` and ``stop_lr`` must be positive floats. ``stop_lr`` can be greater than ``lr`` for warming schedules.
+- At tree T and for all subsequent trees, the scheduler holds at ``stop_lr``.
+
+.. code-block:: python
+
+    # Linear learning rate decay from 0.1 to 0.01 over 100 trees
+    optimizer = {
+        'algo': 'SGD',
+        'lr': 0.1,           # Initial learning rate
+        'stop_lr': 0.01,     # Final learning rate
+        'T': 100,            # Number of trees for the schedule
+        'scheduler': 'Linear',
+        'start_idx': 0,
+        'stop_idx': out_dim
+    }
+
+    tree_struct = {
+        'max_depth': 4,
+        'n_bins': 256,
+        'min_data_in_leaf': 0,
+        'par_th': 2,
+        'grow_policy': 'oblivious'  # Required for GPU linear scheduler
+    }
+
+    gbt_model = GBTModel(
+        input_dim=input_dim,
+        output_dim=out_dim,
+        tree_struct=tree_struct,
+        optimizers=optimizer,
+        params=gbrl_params,
+        verbose=1,
+        device=device
+    )
+
+Monotonic Constraints
+---------------------
+Monotonic constraints enforce that the model output is monotonically increasing or decreasing with respect to specific input features. This is useful for incorporating domain knowledge or ensuring interpretable behavior.
+
+.. note::
+
+    Monotonic constraints are only supported for **oblivious trees** (``grow_policy='oblivious'``).
+    Constraints apply to the output dimensions defined by ``start_idx`` to ``stop_idx-1`` in the 
+    optimizer configuration. For ``GBTModel``, this typically covers all outputs. For actor-critic 
+    models, constraints affect only the policy outputs (not value function outputs).
+
+**How Constraints Are Enforced:**
+
+Monotonic constraints are enforced through two mechanisms:
+
+1. **During tree growing:** Incompatible splits are rejected or pruned to preserve monotonicity.
+   The constraint-aware scoring function pools the left and right child means when a split
+   would violate the monotonic ordering, effectively reducing the score of such splits.
+
+2. **After each tree is built:** Gradient-based updates that would violate constraints are
+   projected or clipped by the optimizer for the affected output indices (``start_idx`` to
+   ``stop_idx-1``). A pool-adjacent-violators (PAVA) algorithm is applied to ensure leaf
+   values respect the specified monotonic ordering.
+
+**Practical Trade-offs:**
+
+- Split search may be slower due to constraint checking and mean pooling during scoring
+- Convergence may be affected for ``GBTModel`` and actor-critic models (policy outputs only)
+  since some gradient directions are restricted
+- The constraint projection ensures predictions are monotonic but may result in suboptimal
+  fit compared to unconstrained models
+
+Setting Monotonic Constraints
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+Constraints are specified as a dictionary mapping feature indices to constraint specifications:
+
+.. code-block:: python
+
+    constraints = {
+        feature_index: (direction, output_indices),
+        ...
+    }
+
+Where:
+
+- ``feature_index``: The input feature to constrain (int or numpy integer type)
+- ``direction``: The constraint direction. All three forms are accepted as valid inputs for each direction:
+  ``'increasing'``, ``'+'``, or ``1`` for increasing constraints, and 
+  ``'decreasing'``, ``'-'``, or ``-1`` for decreasing constraints.
+- ``output_indices``: Single int or list of output dimensions to apply the constraint to
+
+Only specify the features you want to constrain - unlisted features have no constraints.
+
+.. code-block:: python
+
+    from gbrl.models import GBTModel
+
+    input_dim = 4
+    out_dim = 2
+
+    # Feature 0: increasing for output 0
+    # Feature 1: decreasing for outputs 0 and 1
+    # Features 2-3: no constraints (not listed)
+    monotonic_constraints = {
+        0: ("increasing", 0),
+        1: ("decreasing", [0, 1]),
+    }
+
+    tree_struct = {
+        'max_depth': 4,
+        'n_bins': 256,
+        'min_data_in_leaf': 0,
+        'par_th': 2,
+        'grow_policy': 'oblivious'  # Required for monotonic constraints
+    }
+
+    # Specify which output dimensions to optimize (constraints apply to indices start_idx to stop_idx-1)
+    optimizer = {
+        'algo': 'SGD',
+        'lr': 0.1,
+        'start_idx': 0,
+        'stop_idx': out_dim  # constraints apply to output indices 0 to stop_idx-1
+    }
+
+    gbrl_params = {
+        'split_score_func': 'Cosine',
+        'generator_type': 'Quantile'
+    }
+
+    gbt_model = GBTModel(
+        input_dim=input_dim,
+        output_dim=out_dim,
+        tree_struct=tree_struct,
+        optimizers=optimizer,
+        params=gbrl_params,
+        monotonic_constraints=monotonic_constraints,
+        verbose=1,
+        device='cuda'  # GPU supported for oblivious trees
+    )
+
+Combining Schedulers and Constraints
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+Monotonic constraints and linear schedulers can be used together:
+
+.. code-block:: python
+
+    monotonic_constraints = {
+        0: ("increasing", 0),
+        1: ("decreasing", [0, 1]),
+    }
+
+    optimizer = {
+        'algo': 'SGD',
+        'lr': 0.1,
+        'stop_lr': 0.01,
+        'T': 100,
+        'scheduler': 'Linear',
+        'start_idx': 0,
+        'stop_idx': out_dim
+    }
+
+    tree_struct = {
+        'max_depth': 4,
+        'n_bins': 256,
+        'min_data_in_leaf': 0,
+        'par_th': 2,
+        'grow_policy': 'oblivious'
+    }
+
+    gbt_model = GBTModel(
+        input_dim=input_dim,
+        output_dim=out_dim,
+        tree_struct=tree_struct,
+        optimizers=optimizer,
+        params=gbrl_params,
+        monotonic_constraints=monotonic_constraints,
+        verbose=1,
+        device='cuda'
+    )
@@ -7,6 +7,29 @@ Welcome to GBRL's documentation!
 ================================
 GBRL is a Python-based Gradient Boosting Trees (GBT) library, similar to popular packages such as `XGBoost <https://xgboost.readthedocs.io/en/stable/>`__ , `CatBoost <https://catboost.ai/>`__ , but specifically designed and optimized for reinforcement learning (RL). GBRL is implemented in C++/CUDA aimed to seamlessly integrate within popular RL libraries. 
 
+Feature Support Matrix
+----------------------
+
+The following table summarizes feature availability by tree type and device:
+
++----------------------------+---------------------+-------------------+---------------------+-------------------+
+| Feature                    | Greedy CPU          | Greedy GPU        | Oblivious CPU       | Oblivious GPU     |
++============================+=====================+===================+=====================+===================+
+| Tree Fitting               | ✓                   | ✓                 | ✓                   | ✓                 |
++----------------------------+---------------------+-------------------+---------------------+-------------------+
+| Monotonic Constraints      | ✗                   | ✗                 | ✓ (policy only)     | ✓ (policy only)   |
++----------------------------+---------------------+-------------------+---------------------+-------------------+
+| Linear LR Scheduler        | ✓                   | ✗                 | ✓                   | ✓                 |
++----------------------------+---------------------+-------------------+---------------------+-------------------+
+| Constant LR Scheduler      | ✓                   | ✓                 | ✓                   | ✓                 |
++----------------------------+---------------------+-------------------+---------------------+-------------------+
+| ADAM Optimizer             | ✓                   | ✗                 | ✓                   | ✗                 |
++----------------------------+---------------------+-------------------+---------------------+-------------------+
+| SGD Optimizer              | ✓                   | ✓                 | ✓                   | ✓                 |
++----------------------------+---------------------+-------------------+---------------------+-------------------+
+| Control Variates           | ✓                   | ✗                 | ✓                   | ✗                 |
++----------------------------+---------------------+-------------------+---------------------+-------------------+
+
 .. toctree::
    :maxdepth: 2
    :caption: User Guide:
 
@@ -1,5 +1,5 @@
 ##############################################################################
-# Copyright (c) 2024-2025, NVIDIA Corporation. All rights reserved.
+# Copyright (c) 2024-2026, NVIDIA Corporation. All rights reserved.
 #
 # Permission is hereby granted, free of charge, to any person obtaining a
 # copy of this software and associated documentation files (the "Software"),
@@ -19,7 +19,7 @@
 # FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER
 # DEALINGS IN THE SOFTWARE.
 ##############################################################################
-__version__ = "1.1.6"
+__version__ = "1.1.7"
 
 import importlib.util
 import os
 
@@ -1,5 +1,5 @@
 ##############################################################################
-# Copyright (c) 2024-2025, NVIDIA Corporation. All rights reserved.
+# Copyright (c) 2024-2026, NVIDIA Corporation. All rights reserved.
 #
 # Permission is hereby granted, free of charge, to any person obtaining a
 # copy of this software and associated documentation files (the "Software"),
Original file line number	Diff line number	Diff line change
`@@ -1,5 +1,5 @@`
`1`	`1`	`##############################################################################`
`2`		`-# Copyright (c) 2024-2025, NVIDIA Corporation. All rights reserved.`
	`2`	`+# Copyright (c) 2024-2026, NVIDIA Corporation. All rights reserved.`
`3`	`3`	`#`
`4`	`4`	`# Permission is hereby granted, free of charge, to any person obtaining a`
`5`	`5`	`# copy of this software and associated documentation files (the "Software"),`