我用lora训练完之后需要选择一个表现较好的checkpoint,只能逐个export导出模型来测试吗? #4838
Unanswered
fengshansi
asked this question in
Q&A
Replies: 3 comments 7 replies
-
直接用 |
Beta Was this translation helpful? Give feedback.
4 replies
-
一个在检查点文件里.safetensors后缀的文件,
…---原始邮件---
发件人: ***@***.***>
发送时间: 2025年2月18日(周二) 下午2:23
收件人: ***@***.***>;
抄送: ***@***.******@***.***>;
主题: Re: [hiyouga/LLaMA-Factory] 我用lora训练完之后需要选择一个表现较好的checkpoint,只能逐个export导出模型来测试吗? (Discussion #4838)
大家好,我想问问,这是我的saves/Llama-2-7B-Chat/lora/train_2025-02-18-11-47-28是adapter_name_or_path,而adapter_name_or_path下面这么多个文件和文件夹,export模型时,加载的checkpoint到底是哪一个呢?代码中哪一块有体现呢?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
Beta Was this translation helpful? Give feedback.
3 replies
-
请问如果是vl模型可以这样实现吗?我在用qwen2vl,能否不进行merge然后直接加载adapter和base model进行推理? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
主要是导出模型占用磁盘空间过大,如何运行的时候加载lora的checkpoint呢?我使用的是baichuan2和qwen2。
Beta Was this translation helpful? Give feedback.
All reactions