Dev by IamMuhammadZeeshan · Pull Request #839 · kokabsc/gwkokab

IamMuhammadZeeshan · 2026-06-02T22:02:49Z

Summary

Related To

Description

Additional Notes (Optional)

…ions (#836) * feat: implement HDF5 support for saving inference data and configurations * fix: convert HDF5 dataset attributes to a dictionary * Add report generation functionality with Papermill and Jupyter Notebook template - Introduced `generate_report.py` to handle report generation from input data files. - Integrated Papermill for executing Jupyter Notebook templates and generating HTML reports. - Added a new Jupyter Notebook template `template_report.ipynb` for report formatting. - Updated `pyproject.toml` to include new dependencies: `nbconvert`, `papermill`, and `plotly`. - Registered new command line entry point for report generation in `pyproject.toml`. - Included the notebook template in package data for distribution. * Update src/gwkokab/analysis/report/generate_report.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * style: format output notebook path assignment * feat: add corner library dependency for enhanced plotting capabilities Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * fix: ensure compatibility with JAX Array in HDF5 write function Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * feat: using file descriptor to reduce IO overhead --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

… and `SubPopulationModelCore` (#837) * refactor: rename parameters for consistency in `MultiSourceModelCore` and `SubPopulationModelCore` * Update src/gwkokab/analysis/multisource/common.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * fix: reorder parameters in `MultiSourceModelCore` --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

* feat: utils to repack and replace `.h5` and `.hdf5` files * fix: remove bugs from option parsing and update epilog

…#833) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

gemini-code-assist

Code Review

This pull request transitions the storage of inference results, configurations, and metadata from plain text and JSON files to a unified HDF5 format (inference_data.hdf5), updating both FlowMC and NumPyro samplers accordingly. It also introduces an automated HTML report generation tool using a Jupyter notebook template, standardizes parameter naming across hybrid models, and adds an HDF5 repacking script. The review feedback highlights several critical issues: a potential crash in generate_marginal_probs when inspecting classes without an explicit __init__, a bug in write_to_hdf5 where JAX Arrays are not converted to NumPy arrays during dataset overwrites, an argument parsing error in h5repack.py that prevents the file argument from being matched, and a lack of robustness in the report notebook when certain datasets are missing from the HDF5 file.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-02T22:05:27Z

+        if data is not None:
+            if dataset_path in f:
+                del f[dataset_path]
+            elif isinstance(data, Array):
+                data = np.asarray(data)
+            obj = f.create_dataset(dataset_path, data=data)


In write_to_hdf5, the elif isinstance(data, Array): block is skipped if dataset_path in f is True. This means JAX Arrays are not converted to NumPy arrays when overwriting an existing dataset, which can cause h5py to fail or behave unexpectedly. Changing elif to if ensures the conversion always runs.

Suggested change

if data is not None:

if dataset_path in f:

del f[dataset_path]

elif isinstance(data, Array):

data = np.asarray(data)

obj = f.create_dataset(dataset_path, data=data)

if data is not None:

if dataset_path in f:

del f[dataset_path]

if isinstance(data, Array):

data = np.asarray(data)

obj = f.create_dataset(dataset_path, data=data)

gemini-code-assist · 2026-06-02T22:05:27Z

+    param_names = list(inspect.signature(model_meta_cls.__init__).parameters.keys())  # type: ignore
+    param_names.remove("self")


Using inspect.signature(model_meta_cls.__init__) and then param_names.remove("self") will crash if model_meta_cls does not explicitly define __init__ (like EccentricityMattersCore), because object.__init__ has no self parameter in its signature (or raises ValueError). Using inspect.signature(model_meta_cls) is the standard, safe way to get the constructor signature without self and works correctly for all classes.

Suggested change

param_names = list(inspect.signature(model_meta_cls.__init__).parameters.keys()) # type: ignore

param_names.remove("self")

param_names = list(inspect.signature(model_meta_cls).parameters.keys())

gemini-code-assist · 2026-06-02T22:05:27Z

+    parser.add_argument(
+        "options",
+        nargs=argparse.REMAINDER,
+        help="Additional options to pass to h5repack (e.g., '-f GZIP=9 -f SHUF')",
+    )
+    parser.add_argument("file", help="Path to the HDF5 file to repack")
+    args, options = parser.parse_known_args()


Defining a positional argument "options" with nargs=argparse.REMAINDER before "file" causes "options" to consume all remaining arguments (including the file path), leaving "file" unmatched and causing the script to always fail with a required argument error. Removing the "options" argument from the parser allows parse_known_args() to correctly capture all unrecognized options in the options list while successfully parsing "file".

parser.add_argument("file", help="Path to the HDF5 file to repack") args, options = parser.parse_known_args()

gemini-code-assist · 2026-06-02T22:05:27Z

+   "outputs": [],
+   "source": [
+    "if SAMPLER_NAME == \"flowMC\":\n",
+    "    global_acc_train = read_from_hdf5(inference_data_file, \"/acceptances/global/train\")\n",


The notebook assumes that all training/production datasets (like /acceptances/global/train, loss, /chains/train/...) are always present in the HDF5 file. If any phase was skipped or had no data, these datasets won't exist, causing read_from_hdf5 to raise a ValueError and crash the entire automated report generation. Consider checking if the datasets exist in the file before reading them.

Qazalbash and others added 5 commits May 31, 2026 05:43

fix: correct quotation marks in HTML script tag for virtual-webgl.js

e4b0c36

feat: utils to repack and replace .h5 and .hdf5 files (#838)

50e8501

* feat: utils to repack and replace `.h5` and `.hdf5` files * fix: remove bugs from option parsing and update epilog

feat: utilities to calculate marginal probabilities of mixture models (…

e99d0ab

…#833) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

github-actions Bot assigned Qazalbash Jun 2, 2026

github-actions Bot requested a review from Qazalbash June 2, 2026 22:03

IamMuhammadZeeshan closed this Jun 2, 2026

gemini-code-assist Bot reviewed Jun 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dev#839

Dev#839
IamMuhammadZeeshan wants to merge 5 commits into
mainfrom
dev

IamMuhammadZeeshan commented Jun 2, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		param_names = list(inspect.signature(model_meta_cls.__init__).parameters.keys()) # type: ignore
		param_names.remove("self")

Uh oh!

Conversation

IamMuhammadZeeshan commented Jun 2, 2026

Summary

Related To

Description

Additional Notes (Optional)

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants