Add LSTMDoubleFit model for low-dimensional perovskite design#205
Open
Wei-jie-Wu wants to merge 274 commits intoPaddlePaddle:developfrom
Open
Add LSTMDoubleFit model for low-dimensional perovskite design#205Wei-jie-Wu wants to merge 274 commits intoPaddlePaddle:developfrom
Wei-jie-Wu wants to merge 274 commits intoPaddlePaddle:developfrom
Conversation
update code from origin repo
move data_utils.py to dataset/utils.py delete useless code
fix bug
* fix: fix chgnet model download link * fix: set nan to 0
* feat: add task readme * fix error * update logo
* fix: update reshape * fix: fix
* feat: add task readme * fix error * update logo * Add files via upload * Update README.md * Add files via upload * Update README.md
* feat: add task readme * fix error * update logo * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Delete docs/paddlematerial_overview_en.png * Delete docs/paddlematerial_overview_ch.png
* feat: add task readme * fix error * update logo * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Delete docs/paddlematerial_overview_en.png * Delete docs/paddlematerial_overview_ch.png * Delete docs/logo_ppmat.png * Delete docs/ppmat_overview_en.png * Add files via upload * Update README.md * Update README.md * Update README.md * fix conflict
* feat: add task readme * fix error * update logo * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Delete docs/paddlematerial_overview_en.png * Delete docs/paddlematerial_overview_ch.png * Delete docs/logo_ppmat.png * Delete docs/ppmat_overview_en.png * Add files via upload * Update README.md * Update README.md * Update README.md * fix conflict * fix words error
* Update README.md * Update README.md
* matbench_dataset * 训练文件 * Delete megnet_matbench_bulk_modulus_t_20250731_041800_s_42 directory * Delete megnet_matbench_shear_modulus_t_20250731_041740_s_42 directory * matbench数据集适配 * 修改PR * jarvis数据集适配 * megnet_readme修改 * 修改requirements,修改jarvis_dataset
* add DiffNMR * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs of diffprior * fix bug * fix bugs
…set name=alex_mp_20 for mattergen training with alex_mp20 dataset. (PaddlePaddle#200) * fix diffnmr model and config. * fix AlexMP20MatterGenDataset name=alex_mp_20 for mattergen training with alex_mp20 dataset.
|
Thanks for your contribution! |
Collaborator
|
Thanks for your contribution! |
leeleolay
reviewed
Dec 7, 2025
Collaborator
leeleolay
left a comment
There was a problem hiding this comment.
please revise this PR
leeleolay
requested changes
Dec 7, 2025
Collaborator
leeleolay
left a comment
There was a problem hiding this comment.
please revise this PR
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
✅ Description
📘 Overview
This PR contributes the Feature-Guided Inverse Design (LSTMDoubleFit) model for the inverse design of organic A-site cations in low-dimensional perovskites.
The project integrates descriptor calculation, LSTM-based generative learning, and feature-constrained molecular optimization into a unified Paddle-based workflow.
This work reproduces and extends the study:
🧠 Model Workflow
Descriptor Calculation (
Cal.py)Modeldata/.Dataset Preparation
Modeldata/directory into a single dataset:Modeldata.csvwill serve as the unified training dataset.Model Training and Generation (
Best_Seq2seq.py)Feature-Guided DoubleFit Model (
MolecularDoubleFitting.py)Postprocessing
📁 Directory Structure
project/
└── Feature-Guided Inverse Design of LDPs/
├── Best_Seq2seq.py # Main LSTM model: training & molecular generation
├── Cal_ATSC1pe_MATS2c.py # Descriptor calculator (ATSC1pe, MATS2c)
├── Cal_SlogP_VSA2.py # Descriptor calculator (SlogP_VSA2)
├── MolecularDoubleFitting.py # Feature-guided molecular fitting model
├── MSEcalculation.py # Evaluation metrics
├── ModelandDataAnalysis.py # Dataset statistics & analysis
├── Modeldata/ # Folder containing split CSV datasets
├── GreatMolecular.xlsx # High-quality generated molecules
├── NewMolecules.xlsx # Newly generated candidates
├── README.md # Project documentation
└── data_parts/ # (Optional) Split dataset parts (<100 MB each)
⚙️ How to Run
1. Environment