Skip to content

Commit 18547ad

Browse files
addsubmuldivJintao-Huang
authored andcommitted
Update the NPU-supported features table (#7562)
1 parent e10aff5 commit 18547ad

File tree

2 files changed

+56
-6
lines changed

2 files changed

+56
-6
lines changed

docs/source/BestPractices/NPU-support.md

Lines changed: 30 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -336,6 +336,30 @@ ASCEND_RT_VISIBLE_DEVICES=0 swift deploy \
336336
```
337337

338338
## 支持现状
339+
| 一级特性 | 特性 | 进展 |
340+
| -------- | ------------------- | -------- |
341+
| 训练范式 | CPT | 已支持 |
342+
| | SFT | 已支持 |
343+
| | DPO | 已支持 |
344+
| | RM | 已支持 |
345+
| 分布式 | DDP | 已支持 |
346+
| | FSDP | 已支持 |
347+
| | FSDP2 | 已支持 |
348+
| | DeepSpeed | 已支持 |
349+
| | MindSpeed(Megatron) | 已支持 |
350+
| 低参微调 | FULL | 已支持 |
351+
| | LoRA | 已支持 |
352+
| | QLoRA | 暂不支持 |
353+
| RLHF | GRPO | 已支持 |
354+
| | PPO | 已支持 |
355+
| 性能优化 | FA 等融合算子 | 已支持 |
356+
| | Liger-Kernel | 暂不支持 |
357+
| 部署 | PT | 已支持 |
358+
| | vLLM | 已支持 |
359+
| | SGLang | 暂不支持 |
360+
361+
------
362+
339363

340364
### 表 1:SFT 类算法
341365

@@ -371,11 +395,12 @@ ASCEND_RT_VISIBLE_DEVICES=0 swift deploy \
371395

372396
### 表 3:当前 NPU 暂不支持 / 未完全验证的模块
373397

374-
| item |
375-
| ---------------------- |
376-
| Liger-kernel |
377-
| 量化/QLoRA相关 |
378-
| 使用sglang作为推理引擎 |
398+
| item |
399+
| --------------------------------- |
400+
| Liger-kernel |
401+
| 量化/QLoRA相关 |
402+
| 使用sglang作为推理引擎 |
403+
| 使用megatron时开启ETP进行lora训练 |
379404

380405

381406
## NPU微信群

docs/source_en/BestPractices/NPU-support.md

Lines changed: 26 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -318,6 +318,30 @@ ASCEND_RT_VISIBLE_DEVICES=0 swift deploy \
318318

319319
## Current Support Status
320320

321+
| Primary Feature | Feature | Status |
322+
| --------------- | ---------------------- | ------------- |
323+
| Training Paradigm | CPT | Supported |
324+
| | SFT | Supported |
325+
| | DPO | Supported |
326+
| | RM | Supported |
327+
| Distributed | DDP | Supported |
328+
| | FSDP | Supported |
329+
| | FSDP2 | Supported |
330+
| | DeepSpeed | Supported |
331+
| | MindSpeed (Megatron) | Supported |
332+
| PEFT | FULL | Supported |
333+
| | LoRA | Supported |
334+
| | QLoRA | Not Supported |
335+
| RLHF | GRPO | Supported |
336+
| | PPO | Supported |
337+
| Performance Optimization | Fused ops such as FA | Supported |
338+
| | Liger-Kernel | Not Supported |
339+
| Deployment | PT | Supported |
340+
| | vLLM | Supported |
341+
| | SGLang | Not Supported |
342+
343+
---
344+
321345
### Table 1: SFT Algorithms
322346

323347
| Algorithm | Model Families | Strategy | Hardware |
@@ -356,7 +380,8 @@ ASCEND_RT_VISIBLE_DEVICES=0 swift deploy \
356380
| ------------------------ |
357381
| Liger-kernel |
358382
| Quantization/QLoRA |
359-
| Using sglang as inference engine |
383+
| Using SGLang as inference engine |
384+
| Enable ETP for LoRA training when using Megatron |
360385

361386

362387
## NPU WeChat Group

0 commit comments

Comments
 (0)