We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
基于lora的方式finetune模型后,推理时间很长,平均时间要40s,而相同的case直接用chatglm-6b推理平均时间要16s