Implementation of [LFX] Domain-specific large model benchmarks: the edge perspective by IcyFeather233 · Pull Request #196 · kubeedge/ianvs

IcyFeather233 · 2025-05-06T14:36:42Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

This is the implementation of

CNCF - KubeEdge: Domain-specific large model benchmarks: the edge perspective (2025 Term 1)

This PR is related to #177

MooreZheng

The PR looks fine to me so far. A few suggestions:

An issue is needed, showing the need to add a module of pre-process into the single task learning scheme and its consequences to community members (i.e., should be fine since the pre-process module could be skipped).
Take the 1-2-3-4 dimensions out from the data of each query and make the dimension information as metadata under specific directories, in order to make the dataset simple.
Use English after all contented fixed. Besides, there could also be method to ineterate both English and Chinese version for readme (see this link).
Try to fix the CI issue

Further reviews could be taken after demonstrations with experiments.

IcyFeather233 · 2025-06-04T02:32:10Z

It seems that this CI / pylink (3.9) (pull_request) is not caused by my code change.

AryanNanda17 · 2025-06-05T16:11:26Z

@IcyFeather233, you can try syncing your fork's main branch with ianvs main branch. PR #213 which is recently merged, corrects the error.

IcyFeather233 · 2025-06-12T07:06:51Z

@MooreZheng @hsj576 Hi please review this PR

MooreZheng

Overall it looks good now. Some tiny comments on the routine meeting

Update the Kaggle dataset for the embedding
Take a look at the preprocess issue and add advices about Sedna version in this PR if needed. If it is about other documents like quick start, advices could be added using another PR.
Currently the PR has 9 commits and squash the commits into one.

Signed-off-by: IcyFeather <mengzhuo.happy@gmail.com>

IcyFeather233 · 2025-07-22T07:13:44Z

I have updated the kaggle dataset here, and related information are also added in the PR README file.
The preprocess will not influence other examples, to use, just add preprocess function in BaseModel class like this:

@ClassFactory.register(ClassType.GENERAL, alias="gen")
class BaseModel:

    def __init__(self, **kwargs):
        ...

    def preprocess(self, **kwargs):
        # add your preprocess before train
        self.rag = GovernmentRAG(model_name="/path/to/models/bge-m3", device="cuda", persist_directory="./chroma_db")
        LOGGER.info("RAG initialized")


    def train(self, train_data, valid_data=None, **kwargs):
        ...
        
    def save(self, model_path):
        ...

    def predict(self, data, input_shape=None, **kwargs):
        ...

    def load(self, model_url=None):
        ...

    def evaluate(self, data, model_path, **kwargs):
        ...

If it is not needed, do not add this function is ok, because it is written like this:

def _preprocess(self, job):
        if job.preprocess() is None:
            return None
        return job.preprocess()

So it will not influence previous examples.
3. All commits have been merged into one

MooreZheng

/lgtm

MooreZheng · 2025-07-30T07:02:25Z

This is from the LFX mentorship Term 2 and the PR looks good to me now. We need another lgtm from @hsj576 :D

MooreZheng · 2026-01-29T13:06:54Z

This is from the LFX mentorship Term 2 and the PR looks good to me now. We need another lgtm from @hsj576 :D

@hsj576

hsj576

/lgtm

MooreZheng

/approve

kubeedge-bot · 2026-01-31T01:33:25Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hsj576, IcyFeather233, MooreZheng

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [MooreZheng]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

MooreZheng · 2026-01-31T01:48:20Z

Related issue and PR:
Phase 1 Gov LLM #95 #113 #144
Phase 2 Gov RAG #177 #186 #196

kubeedge-bot added the kind/feature Categorizes issue or PR as related to a new feature. label May 6, 2025

kubeedge-bot requested review from MooreZheng and hsj576 May 6, 2025 14:36

kubeedge-bot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label May 6, 2025

IcyFeather233 changed the title ~~LFX:~~ Implementation of [LFX] Domain-specific large model benchmarks: the edge perspective May 6, 2025

IcyFeather233 mentioned this pull request May 8, 2025

REQUEST: New membership for <IcyFeather233> kubeedge/community#213

Closed

7 tasks

MooreZheng requested changes May 8, 2025

View reviewed changes

kubeedge-bot assigned MooreZheng May 8, 2025

MooreZheng mentioned this pull request May 28, 2025

Add Preprocess Module to SingleTaskLearning Paradigm #208

Closed

kubeedge-bot added needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. and removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Jun 12, 2025

MooreZheng requested changes Jun 19, 2025

View reviewed changes

kubeedge-bot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jun 20, 2025

kubeedge-bot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jul 16, 2025

feat(government_rag): update government rag

fa75261

Signed-off-by: IcyFeather <mengzhuo.happy@gmail.com>

IcyFeather233 force-pushed the main branch from 4c56253 to fa75261 Compare July 22, 2025 07:11

kubeedge-bot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jul 22, 2025

Merge branch 'main' into main

b460ead

kubeedge-bot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jul 22, 2025

IcyFeather233 requested a review from MooreZheng July 22, 2025 07:15

MooreZheng reviewed Jul 30, 2025

View reviewed changes

kubeedge-bot added the lgtm Indicates that a PR is ready to be merged. label Jul 30, 2025

MooreZheng linked an issue Jan 29, 2026 that may be closed by this pull request

Add Preprocess Module to SingleTaskLearning Paradigm #208

Closed

hsj576 approved these changes Jan 30, 2026

View reviewed changes

kubeedge-bot assigned hsj576 Jan 30, 2026

MooreZheng approved these changes Jan 31, 2026

View reviewed changes

kubeedge-bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jan 31, 2026

kubeedge-bot merged commit 1d3da5b into kubeedge:main Jan 31, 2026
12 of 13 checks passed

MooreZheng mentioned this pull request Jan 31, 2026

Agent Benchmark for Government Scenarios Based on KubeEdge-Ianvs #199

Open

Conversation

IcyFeather233 commented May 6, 2025

Uh oh!

MooreZheng left a comment

Choose a reason for hiding this comment

Uh oh!

IcyFeather233 commented Jun 4, 2025

Uh oh!

AryanNanda17 commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IcyFeather233 commented Jun 12, 2025

Uh oh!

MooreZheng left a comment

Choose a reason for hiding this comment

Uh oh!

IcyFeather233 commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MooreZheng left a comment

Choose a reason for hiding this comment

Uh oh!

MooreZheng commented Jul 30, 2025

Uh oh!

MooreZheng commented Jan 29, 2026

Uh oh!

hsj576 left a comment

Choose a reason for hiding this comment

Uh oh!

MooreZheng left a comment

Choose a reason for hiding this comment

Uh oh!

kubeedge-bot commented Jan 31, 2026

Uh oh!

Uh oh!

MooreZheng commented Jan 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

AryanNanda17 commented Jun 5, 2025 •

edited

Loading

IcyFeather233 commented Jul 22, 2025 •

edited

Loading