Refactor: Use IANVS_EVAL_WORKSPACE for evaluating metric artifact paths by ARYANPATEL-BIT · Pull Request #386 · kubeedge/ianvs

ARYANPATEL-BIT · 2026-04-11T17:02:34Z

Refactor: Use dynamic EVAL_WORKSPACE for metric artifact paths

🎯 Fixes

**Fixes #385 **

🚨 The Problem

Current hardcoded paths in government_rag evaluation (acc.py):

# ❌ Ignores KubeEdge-Ianvs workspace config
with open("accuracy_results_model.json", "w") as f:    # model mode
with open("accuracy_results_global.json", "w") as f:   # global mode  
with open("accuracy_results_local.json", "w") as f:    # local mode
with open("accuracy_results_other.json", "w") as f:    # other mode
    json.dump(results, f)

Impact:

✅ Bypasses benchmarkingjob.yaml workspace structure
✅ Pollutes root directory with loose JSON files
✅ No isolation between job iterations
✅ Overwrites metrics across different runs

✅ The Solution

Dynamic workspace resolution using IANVS_EVAL_WORKSPACE:

# ✅ Respects KubeEdge-Ianvs workspace context
import os

def save_accuracy_results(results, mode="model"):
    output_dir = os.environ.get("IANVS_EVAL_WORKSPACE", ".")
    filename = f"accuracy_results_{mode}.json"
    output_path = os.path.join(output_dir, filename)
    
    os.makedirs(output_dir, exist_ok=True)
    
    with open(output_path, "w", encoding="utf-8") as f:
        json.dump(results, f, ensure_ascii=False, indent=4)
    
    print(f"✅ Saved: {output_path}")

📁 Before vs After

Mode	Before (Bug)	After (Fixed)
`model`	`./accuracy_results_model.json`	`./workspace/accuracy_results_model.json`
`global`	`./accuracy_results_global.json`	`./workspace/accuracy_results_global.json`
`local`	`./accuracy_results_local.json`	`./workspace/accuracy_results_local.json`
`other`	`./accuracy_results_other.json`	`./workspace/accuracy_results_other.json`

💾 Complete Refactored Function

# government_rag/.../acc.py - Fixed version
import os
import json

def save_metrics_by_mode(results, mode="model"):
    """
    Save accuracy results to Ianvs workspace with mode-specific filename.
    
    Args:
        results: Dict of evaluation metrics
        mode: Inference mode ("model", "global", "local", "other")
    """
    output_dir = os.environ.get("IANVS_EVAL_WORKSPACE", ".")
    filename = f"accuracy_results_{mode}.json"
    output_path = os.path.join(output_dir, filename)
    
    os.makedirs(output_dir, exist_ok=True)
    
    with open(output_path, "w", encoding="utf-8") as f:
        json.dump(results, f, ensure_ascii=False, indent=4)
    
    print(f"✅ Metrics saved for {mode} mode: {output_path}")
    return output_path

# Usage for all modes
save_metrics_by_mode(model_results, "model")
save_metrics_by_mode(global_results, "global")
save_metrics_by_mode(local_results, "local")

🧪 Verification

# Test with workspace
export IANVS_EVAL_WORKSPACE="./experiment_1/output"
python acc.py

# ✅ Creates clean structure:
# ./experiment_1/output/
# ├── accuracy_results_model.json
# ├── accuracy_results_global.json
# ├── accuracy_results_local.json
# └── accuracy_results_other.json

🎉 Benefits

Status: Ready to merge - Proper workspace integration for production benchmarking! 🚀

Signed-off-by: Aryan Patel <aryan.patel7291@gmail.com>

kubeedge-bot · 2026-04-11T17:02:47Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: ARYANPATEL-BIT
To complete the pull request process, please assign jaypume after the PR has been reviewed.
You can assign the PR to them by writing /assign @jaypume in a comment when ready.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

gemini-code-assist

Code Review

This pull request updates the accuracy evaluation script to support a configurable workspace directory via the IANVS_EVAL_WORKSPACE environment variable across multiple functions. Feedback highlights that the implementation lacks directory creation logic, which will cause a FileNotFoundError if the specified workspace does not exist. Additionally, it is recommended to refactor the duplicated file-saving logic into a shared helper function to improve code maintainability.

examples/government_rag/singletask_learning_bench/testenv/acc.py

Signed-off-by: Aryan Patel <aryan.patel7291@gmail.com>

ARYANPATEL-BIT · 2026-04-11T17:07:27Z

/assign @jaypume

Refactor: Use IANVS_EVAL_WORKSPACE for evaluating metric artifact paths

d7a2a80

Signed-off-by: Aryan Patel <aryan.patel7291@gmail.com>

kubeedge-bot requested review from MooreZheng and Poorunga April 11, 2026 17:02

kubeedge-bot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Apr 11, 2026

gemini-code-assist bot reviewed Apr 11, 2026

View reviewed changes

Refactor: Extract shared helper for saving metrics and add os.makedirs

09edf9d

Signed-off-by: Aryan Patel <aryan.patel7291@gmail.com>

kubeedge-bot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Apr 11, 2026

kubeedge-bot assigned jaypume Apr 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor: Use IANVS_EVAL_WORKSPACE for evaluating metric artifact paths#386

Refactor: Use IANVS_EVAL_WORKSPACE for evaluating metric artifact paths#386
ARYANPATEL-BIT wants to merge 2 commits intokubeedge:mainfrom
ARYANPATEL-BIT:fix/acc-workspace-paths

ARYANPATEL-BIT commented Apr 11, 2026

Uh oh!

kubeedge-bot commented Apr 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ARYANPATEL-BIT commented Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ARYANPATEL-BIT commented Apr 11, 2026

Refactor: Use dynamic EVAL_WORKSPACE for metric artifact paths

🎯 Fixes

🚨 The Problem

✅ The Solution

📁 Before vs After

💾 Complete Refactored Function

🧪 Verification

🎉 Benefits

Uh oh!

kubeedge-bot commented Apr 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ARYANPATEL-BIT commented Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants