You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
*DRFP* can be installed from pypi using `pip install drfp`. However, it depends on [RDKit](https://www.rdkit.org/) which is best [installed using conda](https://www.rdkit.org/docs/Install.html).
28
+
*DRFP* can be installed from pypi using `pip install drfp`.
29
29
30
30
Once DRFP is installed, there are two ways you can use it. You can use the cli app `drfp` or the library provided by the package.
31
31
@@ -56,18 +56,18 @@ The variable `fps` now points to a list containing the fingerprints for the two
56
56
57
57
The library contains the class `DrfpEncoder` with one public method `encode`.
58
58
59
-
|`DrfpEncoder.encode()`| Description | Type | Default |
60
-
|-|-|-|-|
61
-
|`X`| An iterable (e.g. a list) of reaction SMILES or a single reaction SMILES to be encoded |`Iterable` or `str`||
62
-
|`n_folded_length`| The folded length of the fingerprint (the parameter for the modulo hashing) |`int`|`2048`|
63
-
|`min_radius`| The minimum radius of a substructure (0 includes single atoms) |`int`|`0`|
64
-
|`radius`| The maximum radius of a substructure |`int`|`3`|
65
-
|`rings`| Whether to include full rings as substructures |`bool`|`True`|
66
-
|`mapping`|Return a feature to substructure mapping in addition to the fingerprints. If true, the return signature of this method is `Tuple[List[np.ndarray], Dict[int, Set[str]]]`|`bool`|`False`|
67
-
|`atom_index_mapping`| Return the atom indices of mapped substructures for each reaction |`bool`|`False`|
68
-
|`root_central_atom`| Whether to root the central atom of substructures when generating SMILES |`bool`|`True`|
69
-
|`include_hydrogens`| Whether to explicitly include hydrogens in the molecular graph |`bool`|`False`|
70
-
|`show_progress_bar`| Whether to show a progress bar when encoding reactions |`bool`|`False`|
|`X`| An iterable (e.g. a list) of reaction SMILES or a single reaction SMILES to be encoded |`Iterable` or `str`||
62
+
|`n_folded_length`| The folded length of the fingerprint (the parameter for the modulo hashing) |`int`|`2048`|
63
+
|`min_radius`| The minimum radius of a substructure (0 includes single atoms) |`int`|`0`|
64
+
|`radius`| The maximum radius of a substructure |`int`|`3`|
65
+
|`rings`| Whether to include full rings as substructures |`bool`|`True`|
66
+
|`mapping`|Return a feature to substructure mapping in addition to the fingerprints. If true, the return signature of this method is `Tuple[List[np.ndarray], Dict[int, Set[str]]]`|`bool`|`False`|
67
+
|`atom_index_mapping`| Return the atom indices of mapped substructures for each reaction |`bool`|`False`|
68
+
|`root_central_atom`| Whether to root the central atom of substructures when generating SMILES |`bool`|`True`|
69
+
|`include_hydrogens`| Whether to explicitly include hydrogens in the molecular graph |`bool`|`False`|
70
+
|`show_progress_bar`| Whether to show a progress bar when encoding reactions |`bool`|`False`|
71
71
72
72
# Reproduce
73
73
Want to reproduce the results in our paper? You can find all the data in the `data` folder and encoding and training scripts in the `scripts` folder.
@@ -82,3 +82,25 @@ Want to reproduce the results in our paper? You can find all the data in the `da
82
82
publisher={Royal Society of Chemistry}
83
83
}
84
84
```
85
+
86
+
## Development Setup
87
+
88
+
This project uses [UV](https://github.com/astral-sh/uv) for dependency management. To set up a development environment:
89
+
90
+
1. Install UV following the [official instructions](https://github.com/astral-sh/uv#installation)
91
+
92
+
2. Clone the repository:
93
+
```bash
94
+
git clone https://github.com/reymond-group/drfp
95
+
cd drfp
96
+
```
97
+
98
+
3. Install dependencies including development packages:
0 commit comments