Skip to content

Releases: hpcaitech/ColossalAI

Version v0.3.0 Release Today!

25 May 08:26
d42b1be
Compare
Choose a tag to compare

What's Changed

Release

Nfc

  • [nfc] fix typo colossalai/ applications/ (#3831) by digger yu
  • [NFC]fix typo colossalai/auto_parallel nn utils etc. (#3779) by digger yu
  • [NFC] fix typo colossalai/amp auto_parallel autochunk (#3756) by digger yu
  • [NFC] fix typo with colossalai/auto_parallel/tensor_shard (#3742) by digger yu
  • [NFC] fix typo applications/ and colossalai/ (#3735) by digger-yu
  • [NFC] polish colossalai/engine/gradient_handler/init.py code style (#3329) by Ofey Chan
  • [NFC] polish colossalai/context/random/init.py code style (#3327) by yuxuan-lou
  • [NFC] polish colossalai/fx/tracer/_tracer_utils.py (#3323) by Michelle
  • [NFC] polish colossalai/gemini/paramhooks/_param_hookmgr.py code style by Xu Kai
  • [NFC] polish initializer_data.py code style (#3287) by RichardoLuo
  • [NFC] polish colossalai/cli/benchmark/models.py code style (#3290) by Ziheng Qin
  • [NFC] polish initializer_3d.py code style (#3279) by Kai Wang (Victor Kai)
  • [NFC] polish colossalai/engine/gradient_accumulation/_gradient_accumulation.py code style (#3277) by Sze-qq
  • [NFC] polish colossalai/context/parallel_context.py code style (#3276) by Arsmart1
  • [NFC] polish colossalai/engine/schedule/_pipeline_schedule_v2.py code style (#3275) by Zirui Zhu
  • [NFC] polish colossalai/nn/_ops/addmm.py code style (#3274) by Tong Li
  • [NFC] polish colossalai/amp/init.py code style (#3272) by lucasliunju
  • [NFC] polish code style (#3273) by Xuanlei Zhao
  • [NFC] policy colossalai/fx/proxy.py code style (#3269) by CZYCW
  • [NFC] polish code style (#3268) by Yuanchen
  • [NFC] polish tensor_placement_policy.py code style (#3265) by Camille Zhong
  • [NFC] polish colossalai/fx/passes/split_module.py code style (#3263) by CsRic
  • [NFC] polish colossalai/global_variables.py code style (#3259) by jiangmingyan
  • [NFC] polish colossalai/engine/gradient_handler/_moe_gradient_handler.py (#3260) by LuGY
  • [NFC] polish colossalai/fx/profiler/experimental/profiler_module/embedding.py code style (#3256) by dayellow

Doc

Workflow

Booster

Docs

  • [docs] change placememt_policy to placement_policy (#3829) by digger yu

Evaluation

  • [evaluation] add automatic evaluation pipeline (#3821) by Yuanchen

Docker

Api

  • [API] add docstrings and initialization to apex amp, naive amp (#3783) by jiangmingyan

Test

Read more

Version v0.2.8 Release Today!

29 Mar 02:26
a0b3749
Compare
Choose a tag to compare

What's Changed

Release

Format

Doc

Application

Chat

Coati

Colossalchat

Examples

Fx

Booster

Ci

Api

Hotfix

Chatgpt

Lazyinit

  • [lazyinit] combine lazy tensor with dtensor (#3204) by ver217
  • [lazyinit] add correctness verification (#3147) by ver217
  • [lazyinit] refactor lazy tensor and lazy init ctx (#3131) by ver217

Auto

Analyzer

Dreambooth

  • [dreambooth] fixing the incompatibity in requirements.txt (#3190) by NatalieC323

Auto-parallel

  • [auto-parallel] add auto-offload feature (#3154) by Zihao

Zero

  • [zero] Refactor ZeroContextConfig class using dataclass (#3186) by YH

Test

Refactor

Tests

  • [tests] model zoo add torchaudio models (#3138) by ver217
  • [tests] diffuser models in model zoo (#3136) by HELSON

Docker

Dtensor

Workflow

  • [workflow] purged extension cache before GPT test (#3128) by Frank Lee

Autochunk

Tutorial

Nvidia

Full Changelog: v0.2.8...v0.2.7

Version v0.2.7 Release Today!

10 Mar 06:56
26db1cb
Compare
Choose a tag to compare

What's Changed

Release

Chatgpt

Kernel

  • [kernel] added kernel loader to softmax autograd function (#3093) by Frank Lee
  • [kernel] cached the op kernel and fixed version check (#2886) by Frank Lee

Analyzer

  • [analyzer] a minimal implementation of static graph analyzer (#2852) by Super Daniel

Diffusers

Doc

Autochunk

Dtensor

Workflow

  • [workflow] fixed doc build trigger condition (#3072) by Frank Lee
  • [workflow] supported conda package installation in doc test (#3028) by Frank Lee
  • [workflow] fixed the post-commit failure when no formatting needed (#3020) by Frank Lee
  • [workflow] added auto doc test on PR (#2929) by Frank Lee
  • [workflow] moved pre-commit to post-commit (#2895) by Frank Lee

Booster

Example

Hotfix

Revert] recover "[refactor

Format

Pipeline

Fx

Refactor

Misc

Autoparallel

  • [autoparallel] apply repeat block to reduce solving time (#2912) by YuliangLiu0306
  • [autoparallel] find repeat blocks (#2854) by YuliangLiu0306
  • [autoparallel] Patch meta information for nodes that will not be handled by SPMD solver (#2823) by Boyuan Yao
  • [autoparallel] Patch meta information of torch.where (#2822) by Boyuan Yao
  • [autoparallel] Patch meta information of torch.tanh() and torch.nn.Dropout (#2773) by Boyuan Yao
  • [autoparallel] Patch tensor related operations meta information (#2789) by Boyuan Yao
  • [autoparallel] rotor solver refactor (#2813) by Boyuan Yao
  • [autoparallel] Patch meta information of torch.nn.Embedding (#2760) by [Boyuan Ya...
Read more

Version v0.2.6 Release Today!

10 Mar 06:57
89aa792
Compare
Choose a tag to compare

What's Changed

Release

Doc

Workflow

  • [workflow] fixed doc build trigger condition (#3072) by Frank Lee
  • [workflow] supported conda package installation in doc test (#3028) by Frank Lee
  • [workflow] fixed the post-commit failure when no formatting needed (#3020) by Frank Lee
  • [workflow] added auto doc test on PR (#2929) by Frank Lee
  • [workflow] moved pre-commit to post-commit (#2895) by Frank Lee

Booster

Example

Autochunk

Chatgpt

Dtensor

Hotfix

Revert] recover "[refactor

Format

Pipeline

Fx

Refactor

Kernel

  • [kernel] cached the op kernel and fixed version check (#2886) by Frank Lee

Misc

Autoparallel

  • [autoparallel] apply repeat block to reduce solving time (#2912) by YuliangLiu0306
  • [autoparallel] find repeat blocks (#2854) by YuliangLiu0306
  • [autoparallel] Patch meta information for nodes that will not be handled by SPMD solver (#2823) by Boyuan Yao
  • [autoparallel] Patch meta information of torch.where (#2822) by Boyuan Yao
  • [autoparallel] Patch meta information of torch.tanh() and torch.nn.Dropout (#2773) by Boyuan Yao
  • [autoparallel] Patch tensor related operations meta information (#2789) by Boyuan Yao
  • [autoparallel] rotor solver refactor (#2813) by Boyuan Yao
  • [autoparallel] Patch meta information of torch.nn.Embedding (#2760) by Boyuan Yao
  • [autoparallel] distinguish different parallel strategies (#2699) by YuliangLiu0306

Zero

Cli

Triton

  • [triton] added copyright information for flash attention (#2835) by Frank Lee

Nfc

  • [NFC] polish colossalai/engine/schedule/_pipeline_schedule.py code style (#2744) by Michelle
  • [NFC] polish code format by binmakeswell
  • [NFC] polish colossala...
Read more

Version v0.2.5 Release Today!

15 Feb 08:53
c5be83a
Compare
Choose a tag to compare

What's Changed

Chatgpt

  • [chatgpt] optimize generation kwargs (#2717) by ver217

Autoparallel

Doc

Devops

Workflow

App

Full Changelog: v0.2.5...v0.2.4

Version v0.2.4 Release Today!

14 Feb 12:07
c3abdd0
Compare
Choose a tag to compare

What's Changed

Release

Doc

Zero1&2

  • [zero1&2] only append parameters with gradients (#2681) by HELSON

Gemini

  • [gemini] fix colo_init_context (#2683) by ver217
  • [gemini] add fake_release_chunk for keep-gathered chunk in the inference mode (#2671) by HELSON

Workflow

Autoparallel

  • [autoparallel] Patch meta information of torch.nn.functional.softmax and torch.nn.Softmax (#2674) by Boyuan Yao

Dooc

Full Changelog: v0.2.4...v0.2.3

Version v0.2.3 Release Today!

13 Feb 01:52
81ea66d
Compare
Choose a tag to compare

What's Changed

Release

Doc

Docs

Autoparallel

  • [autoparallel] Patch meta information of torch.nn.LayerNorm (#2647) by Boyuan Yao

Full Changelog: v0.2.3...v0.2.2

Version v0.2.2 Release Today!

10 Feb 03:02
b673e5f
Compare
Choose a tag to compare

What's Changed

Release

Workflow

Example

Doc

Autoparallel

Tutorial

Autochunk

  • [autochunk] support diffusion for autochunk (#2621) by oahzxl

Build

Test

Full Changelog: v0.2.2...v0.2.1

Version v0.2.1 Release Today!

06 Feb 13:44
f566b0c
Compare
Choose a tag to compare

What's Changed

Workflow

  • [workflow] fixed broken rellease workflows (#2604) by Frank Lee
  • [workflow] added cuda extension build test before release (#2598) by Frank Lee
  • [workflow] hooked pypi release with lark (#2596) by Frank Lee
  • [workflow] hooked docker release with lark (#2594) by Frank Lee
  • [workflow] added test-pypi check before release (#2591) by Frank Lee
  • [workflow] fixed the typo in the example check workflow (#2589) by Frank Lee
  • [workflow] hook compatibility test failure to lark (#2586) by Frank Lee
  • [workflow] hook example test alert with lark (#2585) by Frank Lee
  • [workflow] added notification if scheduled build fails (#2574) by Frank Lee
  • [workflow] added discussion stats to community report (#2572) by Frank Lee
  • [workflow] refactored compatibility test workflow for maintenability (#2560) by Frank Lee
  • [workflow] adjust the GPU memory threshold for scheduled unit test (#2558) by Frank Lee
  • [workflow] fixed example check workflow (#2554) by Frank Lee
  • [workflow] fixed typos in the leaderboard workflow (#2567) by Frank Lee
  • [workflow] added contributor and user-engagement report (#2564) by Frank Lee
  • [workflow] only report coverage for changed files (#2524) by Frank Lee
  • [workflow] fixed the precommit CI (#2525) by Frank Lee
  • [workflow] fixed changed file detection (#2515) by Frank Lee
  • [workflow] fixed the skip condition of example weekly check workflow (#2481) by Frank Lee
  • [workflow] automated bdist wheel build (#2459) by Frank Lee
  • [workflow] automated the compatiblity test (#2453) by Frank Lee
  • [workflow] fixed the on-merge condition check (#2452) by Frank Lee
  • [workflow] make test coverage report collapsable (#2436) by Frank Lee
  • [workflow] report test coverage even if below threshold (#2431) by Frank Lee
  • [workflow]auto comment with test coverage report (#2419) by Frank Lee
  • [workflow] auto comment if precommit check fails (#2417) by Frank Lee
  • [workflow] added translation for non-english comments (#2414) by Frank Lee
  • [workflow] added precommit check for code consistency (#2401) by Frank Lee
  • [workflow] refactored the example check workflow (#2411) by Frank Lee
  • [workflow] added nightly release to pypi (#2403) by Frank Lee
  • [workflow] added missing file change detection output (#2387) by Frank Lee
  • [workflow]New version: Create workflow files for examples' auto check (#2298) by ziyuhuang123
  • [workflow] fixed pypi release workflow error (#2328) by Frank Lee
  • [workflow] fixed pypi release workflow error (#2327) by Frank Lee
  • [workflow] added workflow to release to pypi upon version change (#2320) by Frank Lee
  • [workflow] removed unused assign reviewer workflow (#2318) by Frank Lee
  • [workflow] rebuild cuda kernels when kernel-related files change (#2317) by Frank Lee

Release

Doc

Setup

Tutorial

Polish

  • [polish] polish ColoTensor and its submodules (#2537) by HELSON
  • [polish] polish code for get_static_torch_model (#2405) by HELSON

Kernel

Hotfix

Autochunk

  • [autochunk] add benchmark for transformer and alphafold (#2543) by oahzxl
  • [autochunk] support multi outputs chunk search (#2538) by oahzxl
  • [autochunk] support transformer (#2526) by oahzxl
  • [autochunk] support parsing blocks (#2506) by oahzxl
  • [autochunk] support autochunk on evoformer (#2497) by oahzxl
  • [autochunk] support evoformer tracer (#2485) by oahzxl
  • [autochunk] add autochunk feature by Jiarui Fang

Git

Gemini

Example

  • [example] Add fastfold tutorial (#2528) by [LuGY]...
Read more

Version v0.2.0 Release Today!

03 Jan 12:29
26e171a
Compare
Choose a tag to compare

What's Changed

Version

Examples

  • [examples] using args and combining two versions for PaLM (#2284) by ZijianYY
  • [examples] replace einsum with matmul (#2210) by ZijianYY

Doc

Zero

Example

Hotfix

Autoparallel

Gemini

Pipeline middleware

  • [Pipeline Middleware] Reduce comm redundancy by getting accurate output (#2232) by Ziyue Jiang

Builder

Logger

Diffusion

Testing

  • [testing] add beit model for unit testings (#2196) by HELSON

NFC

Exmaple

  • [exmaple] diffuser, support quant inference for stable diffusion (#2186) by BlueRum
  • [exmaple] add vit missing functions (#2154) by Jiarui Fang

Pipeline middleware

  • [Pipeline Middleware ] Fix deadlock when num_microbatch=num_stage (#2156) by Ziyue Jiang

Full Changelog: v0.2.0...v0.1.13