You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -53,13 +53,13 @@ python setup.py install
53
53
54
54
The UniMol pretrained models can be found at [dptech/Uni-Mol-Models](https://huggingface.co/dptech/Uni-Mol-Models/tree/main).
55
55
56
-
If the download is slow, you can use other mirrors, such as:
56
+
If the download is slow, you can use a mirror, such as:
57
57
58
58
```bash
59
59
export HF_ENDPOINT=https://hf-mirror.com
60
60
```
61
61
62
-
Setting the `HF_ENDPOINT`environment variable specifies the mirror address for the Hugging Face Hub to use when downloading models.
62
+
By default `unimol_tools` first tries the official Hugging Face endpoint. If that fails and `HF_ENDPOINT`is not set, it automatically retries using `https://hf-mirror.com`. Set `HF_ENDPOINT` yourself if you want to explicitly choose a mirror or the official site.
Copy file name to clipboardExpand all lines: docs/source/installation.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -42,7 +42,7 @@ If the download is slow, you can use other mirrors, such as:
42
42
export HF_ENDPOINT=https://hf-mirror.com
43
43
```
44
44
45
-
Setting the `HF_ENDPOINT`environment variable specifies the mirror address for the Hugging Face Hub to use when downloading models.
45
+
By default `unimol_tools` first tries the official Hugging Face endpoint. If that fails and `HF_ENDPOINT`is not set, it automatically retries using `https://hf-mirror.com`. Set `HF_ENDPOINT`to use a specific endpoint.
Copy file name to clipboardExpand all lines: docs/source/quickstart.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -187,11 +187,11 @@ export MASTER_PORT='19198'
187
187
Currently unimol_tools supports five types of fine-tuning tasks: `classification`, `regression`, `multiclass`, `multilabel_classification`, `multilabel_regression`.
188
188
189
189
The datasets used in the examples are all open source and available, including
190
-
- Ames mutagenicity. The dataset includes 6512 compounds and corresponding binary labels from Ames Mutagenicity results.
191
-
- ESOL (delaney) is a standard regression dataset containing structures and water solubility data for 1128 compounds.
192
-
- Tox21 Data Challenge 2014 is designed to help scientists understand the potential of the chemicals and compounds being tested through the Toxicology in the 21st Century initiative to disrupt biological pathways in ways that may result in toxic effects, which includes 12 date sets. The official web site is https://tripod.nih.gov/tox21/challenge/
193
-
- Solvation free energy (FreeSolv). SMILES are provided.
194
-
- Vector-QM24 (VQM24) dataset. Quantum chemistry dataset of ~836 thousand small organic and inorganic molecules.
190
+
- Ames mutagenicity. The dataset includes 6512 compounds and corresponding binary labels from Ames Mutagenicity results. The dataset is available at https://weilab.math.msu.edu/DataLibrary/2D/.
191
+
- ESOL (delaney) is a standard regression dataset containing structures and water solubility data for 1128 compounds. The dataset is available at https://weilab.math.msu.edu/DataLibrary/2D/ and https://huggingface.co/datasets/HR-machine/ESol.
192
+
- Tox21 Data Challenge 2014 is designed to help scientists understand the potential of the chemicals and compounds being tested through the Toxicology in the 21st Century initiative to disrupt biological pathways in ways that may result in toxic effects, which includes 12 date sets. The official web site is https://tripod.nih.gov/tox21/challenge/. The datasets is available at https://moleculenet.org/datasets-1 and https://www.kaggle.com/datasets/maksiamiogan/tox21-dataset.
193
+
- Solvation free energy (FreeSolv). SMILES are provided. The dataset is available at https://weilab.math.msu.edu/DataLibrary/2D/.
194
+
- Vector-QM24 (VQM24) dataset. Quantum chemistry dataset of ~836 thousand small organic and inorganic molecules. The dataset is available at https://zenodo.org/records/15442257.
195
195
196
196
### Example of classification
197
197
You can use a dictionary as input. The default smiles column name is **'SMILES'** and the target column name is **'target'**. You can also customize it with `smiles_col` and `target_cols`.
Copy file name to clipboardExpand all lines: docs/source/weight.rst
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,7 +22,7 @@ If the download is slow, you can use other mirrors, such as:
22
22
23
23
export HF_ENDPOINT=https://hf-mirror.com
24
24
25
-
Setting the ``HF_ENDPOINT`` environment variable specifies the mirror address for the Hugging Face Hub to use when downloading models.
25
+
By default ``unimol_tools`` first tries the official Hugging Face endpoint. If that fails and ``HF_ENDPOINT`` is not set, it automatically retries with ``https://hf-mirror.com``. Set the variable yourself to choose a specific endpoint.
26
26
27
27
`unimol_tools.weights.weight_hub.py <https://github.com/deepmodeling/unimol_tools/blob/main/unimol_tools/weights/weighthub.py>`_ control the logger.
0 commit comments