[LFX] Enhanced cloud-edge-collaborative-inference-for-llm example by AryanNanda17 · Pull Request #188 · kubeedge/ianvs

AryanNanda17 · 2025-04-12T20:25:29Z

The example "Cloud-Edge Collaborative Inference for LLM" is well-structured. This PR improves few areas of the example to make it a fully functional quick-start guide with minimal errors.

The changes done does the following:-

Included a Resource-Sensitive Router
Added api_provider, api_base_url and api_key_env parameters to cloudmodel in test_queryrouting.yaml file.
Ease of Setting up the environment
Backward Compatibility
Updating the Threshold for Random Routing
Error handling in this example is improved
Correcting device = “cuda” assumption

Note:- This PR is an implementation of point 1 of #185. This PR comes under LFX Spring Term 2025 project "Enhancing Dependency Management and Documentation of ianvs".

Fixes #178

AryanNanda17 · 2025-04-14T20:10:56Z

/kind enhancement

kubeedge-bot · 2025-04-14T20:11:00Z

@AryanNanda17: The label(s) kind/enhancement cannot be applied, because the repository doesn't have them

Details

In response to this:

/kind enhancement

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

AryanNanda17 · 2025-04-18T05:10:31Z

@FuryMartin,

Since groq doesn't support the usage options, I have done an approximation of prompt_tokens and completion_tokens :

                prompt_tokens = len(messages[0]['content'].split())  # Approximate
                completion_tokens = len(text.split())  # Approximate

For more accuracy we can use a tokenizer compatible with the model to estimate token counts accurately. A common approach is to use the tiktoken library for models compatible with OpenAI’s tokenization.

That is:-

self.tokenizer = tiktoken.get_encoding("cl100k_base")
prompt_text = "".join([msg["content"] for msg in messages if msg["content"]])
prompt_tokens = len(self.tokenizer.encode(prompt_text))
completion_tokens = len(self.tokenizer.encode(text))

What would you recommend to go with?

MooreZheng

Good to see that all suggestions from other reviewers are properly tackled.

Just some tiny logistics consideration:

It aims to build the cloud-edge-collaborative-inference-for-llm example. The purpose can be specified in the title and description to make it clear to community members. We also need to squash the commits into one.
The CI test for this PR is not yet past, and might not be wrong for this PR. But we should have the CI pass before merging this PR.
It would be appreciated to update the dataset link, as specified below.

AryanNanda17 · 2025-04-24T09:43:02Z

To do:-

Update the dataset link

AryanNanda17 · 2025-04-30T06:30:33Z

@FuryMartin , could you please share with me the updated dataset link?

AryanNanda17 · 2025-05-31T10:11:53Z

The CI workflow failure is described in #212 and has been corrected in #213.

FuryMartin

Nice work

kubeedge-bot · 2025-06-04T02:56:46Z

@FuryMartin: changing LGTM is restricted to collaborators

Details

In response to this:

Nice work

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

dataset instructions changed from hugging face to kaggle matplotlib added to requirements.txt dataset instructions changed from hugging face to kaggle dataset instructions changed from hugging face to kaggle dataset instructions changed from hugging face to kaggle Signed-off-by: Aryan Nanda <nandaaryan823@gmail.com> changes in readme of cloud-edge-collaborative-inference done to use kaggle instead of huggingface Signed-off-by: Aryan <nandaaryan823@gmail.com> readme file updated Signed-off-by: Aryan <nandaaryan823@gmail.com> print changed to logger Signed-off-by: Aryan <nandaaryan823@gmail.com>

AryanNanda17 · 2025-06-05T09:19:07Z

@MooreZheng @FuryMartin @hsj576
I have resolved all the comments. This PR is good to go as well.
Thanks

MooreZheng

/lgtm

hsj576

/lgtm

MooreZheng

/approve

kubeedge-bot · 2025-06-19T14:11:24Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: FuryMartin, hsj576, MooreZheng

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [MooreZheng]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

kubeedge-bot requested review from Poorunga and hsj576 April 12, 2025 20:25

kubeedge-bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Apr 12, 2025

AryanNanda17 changed the title ~~Lfx proposal#185 point1 implementation~~ LFX proposal#185 Point1 Implementation Apr 12, 2025

kubeedge-bot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Apr 13, 2025

AryanNanda17 force-pushed the lfx_proposal#185_point1 branch from ab03606 to c0767d8 Compare April 14, 2025 18:35

FuryMartin reviewed Apr 17, 2025

View reviewed changes

AryanNanda17 force-pushed the lfx_proposal#185_point1 branch from c0767d8 to fe22b2e Compare April 18, 2025 04:15

AryanNanda17 requested a review from FuryMartin April 18, 2025 09:45

AryanNanda17 mentioned this pull request Apr 21, 2025

Track of Runnable Examples #194

Open

25 tasks

MooreZheng requested changes Apr 24, 2025

View reviewed changes

Comment thread examples/cloud-edge-collaborative-inference-for-llm/Dockerfile Outdated

kubeedge-bot assigned MooreZheng Apr 24, 2025

AryanNanda17 changed the title ~~LFX proposal#185 Point1 Implementation~~ Improves cloud-edge-collaborative-inference-for-llm example to make it a fully functional quick-start guide with minimal errors. Apr 24, 2025

AryanNanda17 force-pushed the lfx_proposal#185_point1 branch from ef20a47 to f4e8c9f Compare April 24, 2025 09:41

AryanNanda17 requested a review from MooreZheng April 30, 2025 07:10

AryanNanda17 force-pushed the lfx_proposal#185_point1 branch from 90d61cb to 284c2f6 Compare April 30, 2025 07:50

AryanNanda17 changed the title ~~[LFX] Enhancement of cloud-edge-collaborative-inference-for-llm example to make it a fully functional quick-start guide with minimal errors~~ [LFX] Enhancement of cloud-edge-collaborative-inference-for-llm example May 13, 2025

AryanNanda17 changed the title ~~[LFX] Enhancement of cloud-edge-collaborative-inference-for-llm example~~ [LFX] Enhanced cloud-edge-collaborative-inference-for-llm example May 13, 2025

This was referenced May 21, 2025

fixes onnx package issue #207

Closed

安装完ianvs后使用ianvs -v命令报错ModuleNotFoundError: No module named 'onnx' #205

Closed

AryanNanda17 force-pushed the lfx_proposal#185_point1 branch 2 times, most recently from 089f6f1 to 4f7d591 Compare May 31, 2025 09:33

AryanNanda17 force-pushed the lfx_proposal#185_point1 branch 2 times, most recently from ba8de50 to 51cca08 Compare May 31, 2025 09:56

FuryMartin approved these changes Jun 4, 2025

View reviewed changes

MooreZheng reviewed Jun 5, 2025

View reviewed changes

Comment thread ...-edge-collaborative-inference-for-llm/testalgorithms/query-routing/models/huggingface_llm.py Outdated

AryanNanda17 force-pushed the lfx_proposal#185_point1 branch from 51cca08 to 2a1d15f Compare June 5, 2025 08:52

AryanNanda17 force-pushed the lfx_proposal#185_point1 branch from 2a1d15f to 46340c8 Compare June 5, 2025 09:13

kubeedge-bot added the lgtm Indicates that a PR is ready to be merged. label Jun 5, 2025

MooreZheng reviewed Jun 5, 2025

View reviewed changes

hsj576 approved these changes Jun 12, 2025

View reviewed changes

kubeedge-bot assigned hsj576 Jun 12, 2025

MooreZheng approved these changes Jun 19, 2025

View reviewed changes

kubeedge-bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 19, 2025

kubeedge-bot merged commit 3b13981 into kubeedge:main Jun 19, 2025
13 checks passed

AryanNanda17 mentioned this pull request Aug 15, 2025

REQUEST: New membership for Aryan Nanda kubeedge/community#219

Closed

7 tasks

Conversation

AryanNanda17 commented Apr 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AryanNanda17 commented Apr 14, 2025

Uh oh!

kubeedge-bot commented Apr 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AryanNanda17 commented Apr 18, 2025

Uh oh!

MooreZheng left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AryanNanda17 commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AryanNanda17 commented Apr 30, 2025

Uh oh!

AryanNanda17 commented May 31, 2025

Uh oh!

FuryMartin left a comment

Choose a reason for hiding this comment

Uh oh!

kubeedge-bot commented Jun 4, 2025

Uh oh!

Uh oh!

AryanNanda17 commented Jun 5, 2025

Uh oh!

MooreZheng left a comment

Choose a reason for hiding this comment

Uh oh!

hsj576 left a comment

Choose a reason for hiding this comment

Uh oh!

MooreZheng left a comment

Choose a reason for hiding this comment

Uh oh!

kubeedge-bot commented Jun 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

AryanNanda17 commented Apr 12, 2025 •

edited

Loading

MooreZheng left a comment •

edited

Loading

AryanNanda17 commented Apr 24, 2025 •

edited

Loading