You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This PR introduces lightweight pretraining functionality to Unimol_tools and adds Hydra-based command-line interfaces. It includes a complete pretraining pipeline with masking, loss functions, metrics aggregation, and distributed training support. The PR also updates the CLI tools with configuration management and adds pretrained model path parameters.
Adds comprehensive pretraining infrastructure including model architecture, loss functions, dataset handling, and trainer implementation
Integrates Hydra configuration management for command-line tools
Introduces support for custom pretrained model and dictionary paths across training, representation, and prediction workflows
@@ -19,7 +19,6 @@ Unimol_tools is a easy-use wrappers for property prediction,representation and d
19
19
20
20
## Install
21
21
- pytorch is required, please install pytorch according to your environment. if you are using cuda, please install pytorch with cuda. More details can be found at https://pytorch.org/get-started/locally/
22
-
- currently, rdkit needs with numpy<2.0.0, please install rdkit with numpy<2.0.0.
23
22
24
23
### Option 1: Installing from PyPi (Recommended, for stable version)
25
24
@@ -38,13 +37,13 @@ pip install huggingface_hub
38
37
### Option 2: Installing from source (for latest version)
[`pretrain_config.py`](unimol_tools/pretrain/pretrain_config.py), and checkpoints
239
+
along with the dictionary are saved to the run directory. When GPU memory is
240
+
limited, increase `training.update_freq` to accumulate gradients while keeping
241
+
the effective batch size `n_gpu * training.batch_size * training.update_freq`.
242
+
113
243
## Credits
114
244
We thanks all contributors from the community for their suggestions, bug reports and chemistry advices. Currently unimol-tools is maintained by Yaning Cui, Xiaohong Ji, Zhifeng Gao from DP Technology and AI for Science Insitution, Beijing.
0 commit comments