Skip to content

v0.8.0

Choose a tag to compare

@tiankongdeguiji tiankongdeguiji released this 13 Jun 13:05
· 117 commits to master since this release
b1824f7

Major Features and Improvements

Train/Eval/Export

  • Support eval and save checkpoint by epoch #116
  • Support export fp32/fp16/int8/int4/int2 ebc embedding quant model #137
  • Enhance export efficiency by restoring state dict directly instead of copying and gathering #177
  • Add faiss gpu support for evaluation #170
  • Enhance optimizer state loading for changed plans with plan checkpoint #185
  • Support tensorboard log for model parameters #181
  • Add restore ckpt check for continue train #180
  • Add allow_tf32 flag and global embedding param constraint #188

Model

  • Add MIND model #119 #123 #157 #172
  • Add RocketLaunching model #129
  • Add DLRM model #148
  • Add MaskNet #179 #187
  • Add dice activation and support bn for sequence mlp #107
  • Add regression and multi-classification metric #149
  • Optimize distributed GAUC memory use #127
  • Add SequenceEmbeddingGroup and support jagged forward #152
  • Support max sequence length setting for sequence encoder #184
  • Support hard negative sampler #195
  • Optimize HSTU training and sampling process and add triton ops (WIP) #93 #154

Feature

  • Support custom feature and custom sequence feature #144
  • Weighted id feature support map dtype #190
  • Dump parsed inputs support weighted id and multi-value sequence feature #191

Dataset

  • Support dataset shuffle #114
  • Optimize performance of ParquetDataset and Rebalance parquet files dynamically #125 #126
  • Add odps read session refresh to extend odps session expired time #132
  • Add more alibaba cloud credentials for odps dataset #115
  • Add odps_data_compression (ZSTD) config for OdpsDataset #146
  • Always lazy init odps writer #178

Upgrade

  • Upgrade pytorch to v2.7 and torchrec to v1.2.0 #197

Note

For TorchEasyRec 0.8.x, you should use Docker image version 0.8.

  • For the GPU version (CUDA 12.6):
    • mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easyrec/tzrec-devel:0.8-cu126
    • PyTorch: v2.7 CUDA: v12.6 FBGEMM: v1.2.0 TorchRec: v1.2.0 Python: v3.11
    • We drop support for the 470 GPU driver version. If you still want to use the 470 GPU driver version, you can set LD_LIBRARY_PATH=/usr/local/cuda-12.6/compat
  • For the CPU version:
    • mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easyrec/tzrec-devel:0.8-cpu
    • PyTorch: v2.7 FBGEMM: v1.2.0 TorchRec: v1.2.0 Python: v3.11

Bug Fixes and Other Changes

Full Changelog: v0.7.0...v0.8.0