Feat: Genre Implementation by Jamie001129 · Pull Request #37 · charmlab/recourse_benchmarks

Jamie001129 · 2025-11-21T00:51:42Z

I redid the genre implementation. This time I tried to use all the stuff from the author's repo for accurate reproduction, with just a few modifications to fit our repo. The GenRe models I trained on my Mac and in my environment (Python 3.12, PyTorch "a kind-of-recent version") achieved better reproduction results but are, unfortunately, incompatible with our Python 3.7 environment, and therefore useless. T_T

Now the models are hosted on huggingface (https://huggingface.co/jamie250/genrereproduce) and the reproduce.py can fetch them. We need to add the huggingface-hub library to our environment to support fetching models from huggingface so I modified the requirements-dev.txt as well.

Paper:
FROM SEARCH TO SAMPLING: GENERATIVE MODELS FOR ROBUST ALGORITHMIC RECOURSE
By Prateek Garg*, Lokesh Nagalapatti, Sunita Sarawagi
ICLR 2025

Results (Table 2, Adult dataset, shown as "mine/original"):
Cost: 0.70/0.69
Val: 0.93/1.00
LOF: 0.90/0.98
Score: 1.78/1.93

Potential Factors Affecting Metrics:
1 Environment, especially PyTorch version differences
2 Batch size for transformer training 8192 -> 1024

zkhotanlou · 2025-11-21T23:50:15Z

methods/catalog/genre/model.py

+
+    def get_counterfactuals(self, factuals: pd.DataFrame) -> pd.DataFrame:
+        """
+        Generate counterfactuals


The purpose of this method is to calculate counterfactuals for the input factuals, not to act as a fake interface. The returned counterfactuals should be specific to the given factual input, not always a fixed output for the entire X_test to artificially reproduce the results.

zkhotanlou · 2025-11-21T23:55:38Z

methods/catalog/genre/reproduce.py

+
+    # Initialize GenRe recourse module
+    rec_module = GenReOriginal(
+        pair_model=pair_model,


The recourse method should be able to work with ModelCatalog, which itself works with DataCatalog, this implementation doesn't align with the repo structure.

zkhotanlou · 2025-11-21T23:57:12Z

methods/catalog/genre/reproduce.py

+
+
+if __name__ == "__main__":
+    main()


I tried to run this script, but it doesn't run successfully and gets a ModuleNotFoundError.

…ssay

zkhotanlou · 2025-11-23T07:37:03Z

methods/catalog/genre/reproduce.py

+    # 5. Initialize GenRe using our wrapper
+    print("Initializing GenRe...")
+    genre = GenRe(
+        mlmodel=ann_clf,  # Pass ANN directly


The recourse method still does not align with the repository’s ModelCatalog and DataCatalog structure. Even if the current implementation successfully reproduces the original paper’s results, it cannot yet operate within the benchmark pipeline. After reproduction, the method must be runnable on all existing datasets and models in the benchmark suite. In its current form, the benchmark pipeline would fail.

I have updated model and reproduce. GenRe class can handle both ModelCatalog and my model, and its implementation in reproduce.py will support the repo's structure if we use the commented block which calls the existing functions to load the repo's data and model. And since GenRe's Transformer was trained based on the author's dataset and the blackbox models trained on the author's dataset, therefore switching to the repository's data and models would definitely cause it to fail. In the previous attempt using the repo's models and dataset to reproduce GenRe, I was struggling to get comparable results, so here I used the original data and models for better reproduction.

So, please add a test that performs a sanity check to ensure your implementation is compatible with the repository’s structure. There’s no need to check the reproducibility of results in this test; the goal is to confirm that the method will be able to run experiments later on other datasets and models in the repo.

zkhotanlou · 2025-11-24T17:27:51Z

methods/catalog/genre/reproduce.py

+        args.hf_repo, input_dim, device
+    )
+
+    # ========== FUTURE: Load repo's data and models ==========


The commented implementation doesn’t work. We need to make sure that the implemented recourse method can do two things: first, reproduce the reported results, even when using different datasets and models, and second, work with and align to the repository’s structure.
These checks can be done either separately or together, as long as the recourse method itself remains fixed.

…e Genre method Add a fucntion in reproduce.py to test compatibility(working on it) TODO add data add model if current ones don't work

…eters

… mlp to binary output clean genre folder

Jamie001129 · 2025-11-27T15:56:11Z

I have added my data to datacatalog following the link u gave me it seems to work!
I also change back some imports (the "from xxx.xxx.xxx import xxx" stuff) for it to work on my end

zkhotanlou

After addressing the comments, please add the results of you implemented method on benchmark datasets and models to run_experiments.py and result.csv

zkhotanlou · 2025-12-02T21:10:20Z

data/catalog/_data_main/process_data/process_genre_adult_data.py

Please add the datasets specific to your implementation to that specific directory in methods/catalog/genre. We just need the datasets for benchmarking in ./data directory.

But I need this (and all the other stuff that I added to data/catalog/) to access "my" data through DataCatalog, so that genre aligns with the repo's structure. I did this following the PR #7 you gave me.

I mean just the location of the added dataset, we need to have the datasets specific to your method in the directory of that method.

But the data already exists it in the genre directory. These changes are for accessing my data through Datacatalog

methods/catalog/genre/reproduce.py

…testing

So i add it back

add it back!

fixed it

zkhotanlou · 2025-12-25T09:53:57Z

This is an implementation of the "Genre"[1] recourse method. The level of reproduction is on level1 as the unit tests check the implementation could reproduce results reported in the paper for the Adult dataset on MLP model, and also check to be compatible with the repo's structure.

[1] Garg, P., Nagalapatti, L., & Sarawagi, S. (2025). From Search To Sampling: Generative Models For Robust Algorithmic Recourse. arXiv preprint arXiv:2505.07351.

the third time is the "charm", so what about the the fourth?

9c93b17

zkhotanlou requested changes Nov 21, 2025

View reviewed changes

Jamie001129 and others added 8 commits November 21, 2025 19:54

solve pytest error

e4260e3

update model and reproduce to follow the repo's structure

56eb3d5

fix import errors

b9ebacf

continue sloving the import issues, add library. prefix wherever nece…

71bfebc

…ssay

fix expected range

6c5d548

delete previous saved results

0c41e39

Merge branch 'charmlab:main' into feature/add-genre-v4

5a4faef

Fix pre-commit issues

96aff2c

zkhotanlou changed the title ~~Oh my, it's the GenRe stuff again~~ Feat: Genre Implementation Nov 23, 2025

zkhotanlou added 4 commits November 23, 2025 01:44

Fix: moduleNotFound error

f04995b

fix: lint test.py

7832862

Fix: moduleNotFound error

cdb74d6

Fix: moduleNotFound error

959dc59

zkhotanlou reviewed Nov 23, 2025

View reviewed changes

Support both author models and repo ModelCatalog in GenRe

b75a908

zkhotanlou reviewed Nov 24, 2025

View reviewed changes

Jamie001129 and others added 10 commits November 26, 2025 01:36

start to add agenre that fits the repo

287bcd7

let model.py load the trasnformer since the transformer is part of th…

6f91c48

…e Genre method Add a fucntion in reproduce.py to test compatibility(working on it) TODO add data add model if current ones don't work

How to get the catmask?

5c21504

update model.py so it cant calculate mask based on data

3197c2b

add genre_adult dataset to datacatalog

63bf69c

fix reproduyce2.py to add test_compatibility calls with correct param…

1a65a5b

…eters

fix some bug but there's more to fix

ea4da05

add checkdimensions script

1310523

create wrapper in model.py to change the categorical output of repo's…

44c57c1

… mlp to binary output clean genre folder

fix precommit hook

86632b0

Merge branch 'main' into feature/add-genre-v4

03d5bb0

zkhotanlou reviewed Dec 2, 2025

View reviewed changes

Jamie001129 and others added 9 commits December 2, 2025 18:35

Merge branch 'charmlab:main' into feature/add-genre-v4

8c6451d

Add pytest parametrize decorator to test_compatibility for automated …

d0ba0d7

…testing

Merge branch 'main' into feature/add-genre-v4

8b70982

the genre adult in the yaml is gone?????

9301fe9

So i add it back

remove comments

28095d3

genre_adult in mlmodel_catalog.yaml is also gone???????

9fa0876

add it back!

the is something wrong with the uci_credit part

a1e72ea

fixed it

Fix: precommit hooks

33b098f

Fix: conflicts

d0a23c5

zkhotanlou merged commit ed77748 into charmlab:main Dec 25, 2025
1 check passed

Conversation

Jamie001129 commented Nov 21, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Jamie001129 Nov 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Jamie001129 commented Nov 27, 2025

Uh oh!

zkhotanlou left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Jamie001129 Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zkhotanlou commented Dec 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Jamie001129 Nov 23, 2025 •

edited

Loading

Jamie001129 Dec 5, 2025 •

edited

Loading