feat: RBR Implementation by HashirA123 · Pull Request #27 · charmlab/recourse_benchmarks

HashirA123 · 2025-10-05T03:40:31Z

The reproduction is functional and can be marked as a level one on the reproduction scale. I believe that the only metric that seems not to be in line exactly with the original paper/code is the "current validity", which in the reproduction seems to come consistently lower than it should.

I believe that the method is implemented correctly, so the most probable cause for the difference is the training for the base model.

zkhotanlou · 2025-10-17T22:40:35Z

That's a good implementation. Please add the reproduce tests as well.

amirhk

some high-level feedback/requests

cc @zkhotanlou for detailed implementation review

@HashirA123 agree with Zahra that next steps is reproduce.py

methods/catalog/probe/__init__.py

methods/catalog/probe/library/wachter_rip.py

methods/catalog/probe/model.py

methods/catalog/rbr/model.py

methods/catalog/probe/library/wachter_rip.py

…d. WIP

Initially had the PROBE model commits inside this branch. They have now been moved to their own branch, and this branch now only contains the RBR model commits.

(WIP) since the method is implemented, the main thing left is making the data processing and experiment process the same as the original paper. They fo use the same datasets but process thems slightly different (like using different features). Their model (mlp) is also different. So running with the dataset and model as they are (the way we have them) will definitly not get the same results. Will have to think of how to best combine with their format. One option is not using our model and data catalogs to load the data, and simply port over their model and data creation/processing code.

(WIP)

methods/catalog/rbr/model.py

methods/catalog/rbr/reproduce.py

…d. WIP

Initially had the PROBE model commits inside this branch. They have now been moved to their own branch, and this branch now only contains the RBR model commits.

(WIP) since the method is implemented, the main thing left is making the data processing and experiment process the same as the original paper. They fo use the same datasets but process thems slightly different (like using different features). Their model (mlp) is also different. So running with the dataset and model as they are (the way we have them) will definitly not get the same results. Will have to think of how to best combine with their format. One option is not using our model and data catalogs to load the data, and simply port over their model and data creation/processing code.

(WIP)

…nchmarks into RBR-model

Ran throught the code to simply get the mothod to simply run. Need more work to confirm correctness of results. WIP commit with debug prints and small fixes.

Getting the reproduce for this method was a bit tricky. The main challenges seem to be just making sure that the dataset and models are processed and trained correctly. I have tried my best to build the model just like they have and also processing the dataset the same as them. Although the results are not identical, they can be classified atleast a level 1 on the reproduction scale and definitly can be improved in the near future. The Method does infact work well in finding robust recourse and some metrics seem to be inline with the results of the paper.

methods/catalog/rbr/library/reproduce/data/german.csv

methods/catalog/rbr/library/utils_reproduce.py

methods/catalog/rbr/reproduce.py

I worked to resolve several issues, mainly stemming from the predict function I implemented in the rbr_loss script. The method results in the reproduce file now fairly closely align with those from the original authors code. The method does seem to heavily rely on the effectiveness of the trained model for its recourse finding. That means that if the model is very poor, then this method may not be able to actually get recourse. Overall, in terms of reproduction, I believe this can be marked between a 1-2.

zkhotanlou

This is an implementation of the "RBR"[1] recourse method. The level of reproduction is on level1 as the unit tests checks the implementation could reproduce results reported in the paper for german dataset on neural network.

[1] Nguyen, Tuan-Duy Hien, Ngoc Bui, Duy Nguyen, Man-Chung Yue, and Viet Anh Nguyen. 2022. "Robust Bayesian Recourse." (UAI 2022)

amirhk requested changes Oct 20, 2025

View reviewed changes

zkhotanlou reviewed Oct 20, 2025

View reviewed changes

methods/catalog/probe/library/wachter_rip.py Show resolved Hide resolved

HashirA123 added 4 commits October 21, 2025 13:49

init commit. Added code from Originial repo to here, slightly modifie…

0eff424

…d. WIP

clean up, WIP

bce578f

added RBR to init

9e1f0b8

Seperated work done in This branch

8508444

Initially had the PROBE model commits inside this branch. They have now been moved to their own branch, and this branch now only contains the RBR model commits.

HashirA123 force-pushed the RBR-model branch from ddb37e9 to 8508444 Compare October 21, 2025 17:55

HashirA123 changed the title ~~Rbr model~~ RBR Implementation Oct 21, 2025

HashirA123 added 2 commits October 24, 2025 15:32

Made the reproduce a test function

e580b7a

(WIP)

zkhotanlou reviewed Oct 26, 2025

View reviewed changes

methods/catalog/rbr/model.py Outdated Show resolved Hide resolved

methods/catalog/rbr/reproduce.py Outdated Show resolved Hide resolved

HashirA123 added 10 commits October 31, 2025 11:35

init commit. Added code from Originial repo to here, slightly modifie…

3318741

…d. WIP

clean up, WIP

d2c9067

added RBR to init

024b3d9

Seperated work done in This branch

e4674b1

Initially had the PROBE model commits inside this branch. They have now been moved to their own branch, and this branch now only contains the RBR model commits.

Made the reproduce a test function

e19a431

(WIP)

Merge branch 'RBR-model' of https://github.com/HashirA123/recourse_be…

03deb15

…nchmarks into RBR-model

Getting RBR method to run

6c6123b

Ran throught the code to simply get the mothod to simply run. Need more work to confirm correctness of results. WIP commit with debug prints and small fixes.

Got Reproduce working

bf46302

HashirA123 marked this pull request as ready for review November 9, 2025 18:57

HashirA123 added 2 commits November 9, 2025 14:26

Merge remote-tracking branch 'origin/main' into RBR-model

680da7d

ran run_experiment for RBR

617f5fa

HashirA123 changed the title ~~RBR Implementation~~ feat: RBR Implementation Nov 9, 2025

HashirA123 marked this pull request as draft November 11, 2025 22:24

HashirA123 added 3 commits November 12, 2025 18:58

resolved merge conflicts

e816c9e

fixed issue regarding cuda-cpu mismatch on Probe

3ce0d6b

fixed cuda-cpu mismatch error for RBR

deb2fa4

HashirA123 added 2 commits November 13, 2025 12:36

ran pre-commit hooks

993d8b2

reverted breaking changes to the requirements-dev.txt

9ac893d

HashirA123 marked this pull request as ready for review November 19, 2025 16:44

HashirA123 added 2 commits November 20, 2025 15:18

fixed validity calculations

8a60036

updated var names

21ea4ca

zkhotanlou reviewed Nov 20, 2025

View reviewed changes

methods/catalog/rbr/library/reproduce/data/german.csv Show resolved Hide resolved

methods/catalog/rbr/library/utils_reproduce.py Outdated Show resolved Hide resolved

methods/catalog/rbr/reproduce.py Outdated Show resolved Hide resolved

methods/catalog/rbr/reproduce.py Show resolved Hide resolved

HashirA123 added 3 commits November 20, 2025 22:54

ran precommit hooks

5b6b9ed

resolved merge conflicts

bccf19e

HashirA123 requested a review from zkhotanlou November 21, 2025 04:04

zkhotanlou reviewed Nov 22, 2025

View reviewed changes

zkhotanlou merged commit 4b565db into charmlab:main Nov 22, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: RBR Implementation#27

feat: RBR Implementation#27
zkhotanlou merged 28 commits intocharmlab:mainfrom
HashirA123:RBR-model

HashirA123 commented Oct 5, 2025 •

edited

Loading

Uh oh!

zkhotanlou commented Oct 17, 2025

Uh oh!

amirhk left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zkhotanlou left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

HashirA123 commented Oct 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zkhotanlou commented Oct 17, 2025

Uh oh!

amirhk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zkhotanlou left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HashirA123 commented Oct 5, 2025 •

edited

Loading