GitHub - BilodeauGroup/PepFoundry

______________________________________________________________________

PepFoundry is a Python package designed to streamline peptide modeling beyond natural amino acids and linear topologies. It enables the incorporation of synthetic (non-canonical) amino acids and produces both RDKit molecule objects and peptide graphs, facilitating their use in machine learning applications.

In addition, PepFoundry supports the generation of cyclic peptides. These peptides are also represented as RDKit molecule objects and graphs, making them suitable for advanced computational analysis and ML workflows.

New Updates

Dec.01/2025, Version 1.1.1: We have added a new method get_amino_acids, this return list of RDKit molecule objects, each representing a single amino acid. See usage examples in examples_PepFoundry
Nov.26/2025, Version 1.1.0: We have added a new method get_smiles_chuckles_format that automatically converts peptide SMILES into CHUCKLES format, including mapping numbers for the terminal residues. This update introduces a new dependency, openbabel. Usage and examples of this method can be found in examples_CHUCKLES.ipynb.

1. Installation Guide

1.1. Creating an Environment with PepFoundry

To automatically create the environment with all required packages, download the file setup_pepfoundry.sh and run the following command:

bash setup_pepfoundry.sh

1.2. Creating an Anaconda Environment Manually

Alternatively, you can create an Anaconda environment manually by running the following commands manually in the terminal:

1.2.1. Creating the Environment

conda create --name pepfoundry python=3.7.16

1.2.2. Activating the Environment

conda activate pepfoundry

1.2.3. Installing Dependencies

pip install rdkit

If you have a CUDA-compatible GPU:

pip install torch==1.13.1+cu117 torchvision==0.14.1+cu117 -f https://download.pytorch.org/whl/torch_stable.html

Else:

pip install torch==1.13.1 torchvision==0.14.1 -f https://download.pytorch.org/whl/torch_stable.html

pip install openpyxl

pip install scikit-learn

pip install ipykernel

pip install pandas

pip install openbabel-wheel

1.2.4. Installing PepFoundry from GitHub

pip install git+https://github.com/BilodeauGroup/PepFoundry.git

2. Usage

Once installed, you can import and use the package in your Python scripts:

from pepfoundry.interface import PepFoundry

2.1. PepFoundry Class

PepFoundry is the central interface for building peptide RDKit Mol objects. It combines the functionalities of peptide construction and amino acid processing through internal modules.

Before using it, you need to create an instance of the class:

pepfoundry = PepFoundry()

The class use the default database.

Default:
Loads the standard amino acid database included with the package. amino_acids_library
Custom Database
Optionally, you can provide a custom amino acid database for each class instance by passing the path to an Excel file:

pepfoundry = PepFoundry(custom_dict_path="path/to/custom_amino_acids.xlsx")

Important: The Excel file should adhere to the format and conventions defined in the default database, with amino acids defined in the CHUCKLES format, including Map Numbers. Following this structure ensures that the peptide builder can correctly interpret the amino acids and construct molecules without errors.

2.2. Amino Acid Convention

Canonical Amino Acids:
- L-amino acids are represented with uppercase letters (e.g., A for L-Alanine).
- D-amino acids are represented with lowercase letters (e.g., a for D-Alanine).
Non-Canonical amino acids are enclosed in curly braces {Xyz}.
Modifications such as acetylation and amidation are also enclosed in {}, e.g.:
- {ac} for acetylation
- {am} for amidation

3. Examples:

3.1. PepFoundry Implementation

Full usage examples are provided in:

examples_PepFoundry

3.2. CHUCKLES Construction

SMILES construction or rewriting (CHUCKLES format):
Examples of how to construct or rewrite SMILES for amino acids in CHUCKLES format are provided in:

examples_CHUCKLES.ipynb

3.3. ML Implementation

Examples of how PepFoudry can be implemented for ML application is provided in:

ML example

4. Cite

Garzon Otero, D.; Akbari, O.; Mandapati, A.; Bilodeau, C. PepFoundry: A Pipeline for Building Machine-Learning Ready Representations of Nonstandard Peptides Containing Cycles, Non-natural Residues, Polymer Units, and More. J. Chem. Inf. Model. ASAP. https://doi.org/10.1021/acs.jcim.5c02629

Name		Name	Last commit message	Last commit date
Latest commit History 184 Commits
example_ML		example_ML
fig		fig
pepfoundry		pepfoundry
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
examples_CHUCKLES.ipynb		examples_CHUCKLES.ipynb
examples_PepFoundry.ipynb		examples_PepFoundry.ipynb
setup.py		setup.py
setup_pepfoundry.sh		setup_pepfoundry.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

New Updates

1. Installation Guide

1.1. Creating an Environment with PepFoundry

1.2. Creating an Anaconda Environment Manually

1.2.1. Creating the Environment

1.2.2. Activating the Environment

1.2.3. Installing Dependencies

1.2.4. Installing PepFoundry from GitHub

2. Usage

2.1. PepFoundry Class

2.2. Amino Acid Convention

3. Examples:

3.1. PepFoundry Implementation

3.2. CHUCKLES Construction

3.3. ML Implementation

4. Cite

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

New Updates

1. Installation Guide

1.1. Creating an Environment with PepFoundry

1.2. Creating an Anaconda Environment Manually

1.2.1. Creating the Environment

1.2.2. Activating the Environment

1.2.3. Installing Dependencies

1.2.4. Installing PepFoundry from GitHub

2. Usage

2.1. PepFoundry Class

2.2. Amino Acid Convention

3. Examples:

3.1. PepFoundry Implementation

3.2. CHUCKLES Construction

3.3. ML Implementation

4. Cite

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages