Commit 3bf9c85
[Fix] Merge previous contributions from fw/refactor to lite (areal-project#163)
* initial proposal
* add arealite
* .
* change api
* .
* remove LOG_ROOT
* remove MODEL_SAVE_PATH
* remove PARAM_REALLOC_PATH, DATASET_CACHE
* prepare for testing
* prepare for testing
* ready for run
* local run
* tests mainly pass
* format
* .
* amend cluster.py
* .
* .
* client test pass
* pass rollout test
* remove unused imports
* add arealite readme
* change api
* .
* .
* .
* .
* .
* .
* .
* .
* format
* .
* implement iteraptable generation (areal-project#112)
Co-authored-by: zhaochenyang <zhaochenyang20@gmail.com>
* .
* fix
* .
* .
* .
* pass controller generate batch test
* .
* refactor rollout controller into worker and controller
* .
* .
* .
* change to async rollout
* pass rollout controller test
* pass test
* .
* update readme
* .
* sft debug
* .
* add lisence
* remove unused files
* remove unsed args in ppo
* add hf engine wrapper (areal-project#116)
* add hf engine
* fix issues
* fix ppo bugs and add test
* add hf client interface and modify cli args
* fix bugs
* fix issues
* Merge fw/refactor
* Finish hf wrapper test
* add test
---------
Co-authored-by: Wei Fu <36355462+garrett4wade@users.noreply.github.com>
* format
* format
* .
* refine hf engine
* .
* fix
* add fsdp engine and sft tests
* .
* .
* .
* pass ppo unittest
* pass ppo and rollout controller tests
* clear unused imports
* rename ppo to grpo
* change reward function organization
* reorganize code
* add dataset api
* .
* .
* .
* format
* chmod fix
* .
* rename workflow to collector
* refactor llm_client location
* .
* .
* fix llm server api
* refactor config structure
* .
* fix tests
* .
* .
* .
* Fix unresolved issue in SFTTrainer PR (areal-project#139)
* .
* .
* efficient loading
* format
* .
* .
* .
* .
* .
* .
* Add CI for testing AReaLite (areal-project#150)
* ci: add test-arealite
* ci: add checkout before running test-arealite
* ci: add USERNAME
* ci: add test script
* ci: add GitHub mirror
* ci: fix typo
* ci: clone one commit
* ci: fix condition
* ci: set command timeout to 60m
* ci: enable pip cache
* ci: optimize container lifecycle
* ci: split into many stages
* ci(test-arealite): fix typo
* ci: fix wrong env
* ci: fix pytest
* ci: uninstall transformer-engine
* ci: uninstall transformer-engine
* ci: fix model paths
* ci: show stdout/stderr
* ci: fix not clean up
* ci: backup sglang
* ci: remove tmp repo dir when run
* ci: fix docker run exit 1 condition
* ci(test-arealite): limit the concurrency and extend command timeout
* .
* merge fw/refactor
* revert some changes
* fix
---------
Co-authored-by: meizhiyu.mzy <meizhiyu.mzy@antgroup.com>
Co-authored-by: Chayenne <zhaochen20@outlook.com>
Co-authored-by: zhaochenyang <zhaochenyang20@gmail.com>
Co-authored-by: Jayon02 <qiujiangc@outlook.com>
Co-authored-by: root <meizhiyu.mzy>
Co-authored-by: Zijian Zhang <futrime@outlook.com>1 parent d48bf00 commit 3bf9c85
5 files changed
Lines changed: 9 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
737 | 737 | | |
738 | 738 | | |
739 | 739 | | |
740 | | - | |
| 740 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
7 | 10 | | |
8 | 11 | | |
9 | 12 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
53 | 53 | | |
54 | 54 | | |
55 | 55 | | |
| 56 | + | |
56 | 57 | | |
57 | 58 | | |
58 | | - | |
59 | 59 | | |
60 | 60 | | |
61 | 61 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
| 11 | + | |
11 | 12 | | |
12 | 13 | | |
13 | 14 | | |
| |||
47 | 48 | | |
48 | 49 | | |
49 | 50 | | |
| 51 | + | |
50 | 52 | | |
51 | 53 | | |
52 | 54 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
69 | 69 | | |
70 | 70 | | |
71 | 71 | | |
| 72 | + | |
72 | 73 | | |
73 | 74 | | |
74 | 75 | | |
75 | | - | |
76 | | - | |
| 76 | + | |
0 commit comments