Skip to content

[LFX Term 1 2026] Restoring LLM Edge Benchmark Suite Single Task Bench With Compression#409

Merged
kubeedge-bot merged 2 commits intokubeedge:mainfrom
NishantSinghhhhh:Restoration_single_task_bench_with_compression
May 7, 2026
Merged

[LFX Term 1 2026] Restoring LLM Edge Benchmark Suite Single Task Bench With Compression#409
kubeedge-bot merged 2 commits intokubeedge:mainfrom
NishantSinghhhhh:Restoration_single_task_bench_with_compression

Conversation

@NishantSinghhhhh
Copy link
Copy Markdown
Contributor

What type of PR is this?
/kind bug
/kind cleanup
/kind feature

What this PR does / why we need it:
This PR refactors and restores the single_task_bench_with_compression suite within the LLM Edge Benchmark to ensure it successfully executes end-to-end within the Ianvs framework.

Previously, the compression benchmarking pipeline failed due to unsupported paradigm types, missing pipeline contract methods in the base model script, and relative path resolution errors.

Key improvements and fixes include:

  • Paradigm & Configuration Alignment: Corrected the paradigm_type in algorithm.yaml from the unsupported singletasklearning_with_compression to the recognized singletasklearning. Fixed relative pathing bugs by ensuring model and dataset URLs explicitly use absolute paths across all YAML configurations (testenv.yaml, algorithm.yaml, and benchmarkingjob.yaml hyperparameter overrides).

  • Pipeline Contract Fulfillment: Updated basemodel.py to satisfy the strict Ianvs ML lifecycle checks by adding preprocess and postprocess pass-through methods with optional **kwargs.

…ompression

refactor: adjust algorithm YAML for paradigm type and enhance basemodel.py with logging and model execution improvements

Signed-off-by: NishantSinghhhhh <nishantsingh_230137@aitpune.edu.in>
@kubeedge-bot kubeedge-bot added kind/bug Categorizes issue or PR as related to a bug. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. kind/feature Categorizes issue or PR as related to a new feature. labels Apr 23, 2026
@kubeedge-bot kubeedge-bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Apr 23, 2026
Signed-off-by: NishantSinghhhhh <nishantsingh_230137@aitpune.edu.in>
@NishantSinghhhhh
Copy link
Copy Markdown
Contributor Author

@MooreZheng sir, after making changes the example works perfectly fine

Screencast.from.2026-04-24.00-10-15.webm

Copy link
Copy Markdown

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a compression benchmarking suite for LLMs within the Ianvs framework, featuring a comprehensive setup guide, updated dependencies, and a LlamaCppModel implementation that utilizes streaming to measure prefill latency. The review feedback suggests fixing markdown syntax and relative paths in the documentation, transitioning to module-level logging for better practice, and adding safety checks when processing model output chunks to prevent potential runtime errors.

Copy link
Copy Markdown
Member

@hsj576 hsj576 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@kubeedge-bot kubeedge-bot added the lgtm Indicates that a PR is ready to be merged. label May 7, 2026
@MooreZheng
Copy link
Copy Markdown
Collaborator

/lgtm

Copy link
Copy Markdown
Collaborator

@MooreZheng MooreZheng left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/approve

@kubeedge-bot
Copy link
Copy Markdown
Collaborator

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hsj576, MooreZheng, NishantSinghhhhh

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@kubeedge-bot kubeedge-bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 7, 2026
@kubeedge-bot kubeedge-bot merged commit 5ff5046 into kubeedge:main May 7, 2026
12 of 13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. kind/bug Categorizes issue or PR as related to a bug. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. kind/feature Categorizes issue or PR as related to a new feature. lgtm Indicates that a PR is ready to be merged. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants