File tree Expand file tree Collapse file tree 2 files changed +56
-6
lines changed
Expand file tree Collapse file tree 2 files changed +56
-6
lines changed Original file line number Diff line number Diff line change @@ -336,6 +336,30 @@ ASCEND_RT_VISIBLE_DEVICES=0 swift deploy \
336336```
337337
338338## 支持现状
339+ | 一级特性 | 特性 | 进展 |
340+ | -------- | ------------------- | -------- |
341+ | 训练范式 | CPT | 已支持 |
342+ | | SFT | 已支持 |
343+ | | DPO | 已支持 |
344+ | | RM | 已支持 |
345+ | 分布式 | DDP | 已支持 |
346+ | | FSDP | 已支持 |
347+ | | FSDP2 | 已支持 |
348+ | | DeepSpeed | 已支持 |
349+ | | MindSpeed(Megatron) | 已支持 |
350+ | 低参微调 | FULL | 已支持 |
351+ | | LoRA | 已支持 |
352+ | | QLoRA | 暂不支持 |
353+ | RLHF | GRPO | 已支持 |
354+ | | PPO | 已支持 |
355+ | 性能优化 | FA 等融合算子 | 已支持 |
356+ | | Liger-Kernel | 暂不支持 |
357+ | 部署 | PT | 已支持 |
358+ | | vLLM | 已支持 |
359+ | | SGLang | 暂不支持 |
360+
361+ ------
362+
339363
340364### 表 1:SFT 类算法
341365
@@ -371,11 +395,12 @@ ASCEND_RT_VISIBLE_DEVICES=0 swift deploy \
371395
372396### 表 3:当前 NPU 暂不支持 / 未完全验证的模块
373397
374- | item |
375- | ---------------------- |
376- | Liger-kernel |
377- | 量化/QLoRA相关 |
378- | 使用sglang作为推理引擎 |
398+ | item |
399+ | --------------------------------- |
400+ | Liger-kernel |
401+ | 量化/QLoRA相关 |
402+ | 使用sglang作为推理引擎 |
403+ | 使用megatron时开启ETP进行lora训练 |
379404
380405
381406## NPU微信群
Original file line number Diff line number Diff line change @@ -318,6 +318,30 @@ ASCEND_RT_VISIBLE_DEVICES=0 swift deploy \
318318
319319## Current Support Status
320320
321+ | Primary Feature | Feature | Status |
322+ | --------------- | ---------------------- | ------------- |
323+ | Training Paradigm | CPT | Supported |
324+ | | SFT | Supported |
325+ | | DPO | Supported |
326+ | | RM | Supported |
327+ | Distributed | DDP | Supported |
328+ | | FSDP | Supported |
329+ | | FSDP2 | Supported |
330+ | | DeepSpeed | Supported |
331+ | | MindSpeed (Megatron) | Supported |
332+ | PEFT | FULL | Supported |
333+ | | LoRA | Supported |
334+ | | QLoRA | Not Supported |
335+ | RLHF | GRPO | Supported |
336+ | | PPO | Supported |
337+ | Performance Optimization | Fused ops such as FA | Supported |
338+ | | Liger-Kernel | Not Supported |
339+ | Deployment | PT | Supported |
340+ | | vLLM | Supported |
341+ | | SGLang | Not Supported |
342+
343+ ---
344+
321345### Table 1: SFT Algorithms
322346
323347| Algorithm | Model Families | Strategy | Hardware |
@@ -356,7 +380,8 @@ ASCEND_RT_VISIBLE_DEVICES=0 swift deploy \
356380| ------------------------ |
357381| Liger-kernel |
358382| Quantization/QLoRA |
359- | Using sglang as inference engine |
383+ | Using SGLang as inference engine |
384+ | Enable ETP for LoRA training when using Megatron |
360385
361386
362387## NPU WeChat Group
You can’t perform that action at this time.
0 commit comments