Skip to content

【Hackathon 10th Spring No.14】SFIN模型复现#244

Open
ADream-ki wants to merge 11 commits intoPaddlePaddle:developfrom
ADream-ki:sfin
Open

【Hackathon 10th Spring No.14】SFIN模型复现#244
ADream-ki wants to merge 11 commits intoPaddlePaddle:developfrom
ADream-ki:sfin

Conversation

@ADream-ki
Copy link

No description provided.

@paddle-bot
Copy link

paddle-bot bot commented Feb 11, 2026

Thanks for your contribution!

@paddle-bot paddle-bot bot added the contributor External developers label Feb 11, 2026
@leeleolay
Copy link
Collaborator

辛苦提供数据集链接和模型预训练权重,并且补充相应的的readme文件

@ADream-ki
Copy link
Author

sfin模型并没有预训练权重,数据集应该是放到百度网盘吧?

@ADream-ki
Copy link
Author

https://pan.baidu.com/s/11uT6O_SjJQ0tpk7U6IER1g?pwd=tbvc
这个链接里面是对应的数据、最终训练好的权重

@leeleolay
Copy link
Collaborator

建议这个任务修改为Spectrum Enhancement(SE)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

辛苦使用工厂函数的方式来加载数据集,另外数据集可以参考其他的数据集的格式,压缩为tar.gz格式,并且添加支持自动下载的功能,数据集可以分为训练,验证,测试集,在train.py和predict.py里面对应起来相应的流程

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

我咨询了论文的原作者,他提供了数据集以及训练代码

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

https://pan.baidu.com/s/1qD4qUJmCLg_hy_P6YIqiaw?pwd=u183
这是训练数据,能否上传后提供对应的链接?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

dataset这个文件还是没有使用build工厂函数的方式,可以参考mp20 dataset的实现逻辑,数据集是支持自动下载的,辛苦按照原先的逻辑实现

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

是现在这样吗?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

evaluate.py这个文件的作用只是验证模型在验证集的效果过?如果是这样的话建议和train.py合并,并且只保留predict.py

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里的链接在修改名字后记得辛苦更新下

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已经修改

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

参考其他模型的readme页面,需要添加模型的结果,并且在该页面附上模型链接,除了预训练模型权重,还有log文件,建议下载其他的模型看一下格式

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sfin模型没有预训练权重文件,这个时候上传log就好了吧?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@leeleolay
Copy link
Collaborator

leeleolay commented Feb 23, 2026

预训练模型权重链接:https://paddle-org.bj.bcebos.com/paddlematerials/checkpoints/spectrum_enhancement/sfin/sfin_he_500.pdparams
,建议增加log文件。可以把原先的公开的torch格式的预训练模型权重转化为paddle的并给出链接

@luotao1 luotao1 changed the title 【Hackathon_10th】NO.14 SFIN模型复现 【Hackathon 10th Spring No.14】SFIN模型复现 Feb 26, 2026
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

dataset这个文件还是没有使用build工厂函数的方式,可以参考mp20 dataset的实现逻辑,数据集是支持自动下载的,辛苦按照原先的逻辑实现

__class_name__: STEMImageDataset
__init_params__:
data_path: "./bf_data"
url: "https://paddle-org.bj.bcebos.com/paddlematerials/datasets/SFIN_datasets/bf_data.zip"
Copy link
Collaborator

@leeleolay leeleolay Feb 27, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

数据集是支持后台下载的方式,辛苦不显示指定,参考mp20数据集的方式,在dataset.py里可以声明链接

README.md Outdated
## 📑 Task
- [MLIP-Machine Learning Interatomic Potential](interatomic_potentials/README.md)
- [MLES-Machine Learning Electronic Structure](electronic_structure/README.md)
- [CM-Crystal Materials](crystal_materials/README.md)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里辛苦修改

| **Datasets** | |
| HAADF/BF paired `noisy` / `gt_enhance` / `gt_detect` datasets | ✅ |

## 3.Configurations
Copy link
Collaborator

@leeleolay leeleolay Feb 27, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

任务介绍readme里面辛苦删掉config的部分

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已经修改了


Spectrum Enhancement (SE) focuses on enhancing and denoising spectral and microscopy data for crystalline materials. Leveraging advanced deep learning techniques, SE aims to recover high-quality signals from noisy observations, enabling more accurate analysis of material properties at the atomic scale. This task is particularly valuable for STEM (Scanning Transmission Electron Microscopy) image processing, where noise reduction can significantly improve the visualization of crystal structures and defects.

Current SFIN cases support:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

和模型相关的内容移动到模型readme里


## 1.Introduction

Spectrum Enhancement (SE) focuses on enhancing and denoising spectral and microscopy data for crystalline materials. Leveraging advanced deep learning techniques, SE aims to recover high-quality signals from noisy observations, enabling more accurate analysis of material properties at the atomic scale. This task is particularly valuable for STEM (Scanning Transmission Electron Microscopy) image processing, where noise reduction can significantly improve the visualization of crystal structures and defects.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

【for crystalline materials】这句话删掉吧

| Distributed training | ✅ |
| Mixed precision (AMP) | — |
| Fine-tuning | ✅ |
| **ML Capabilities · Evaluation** | |
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里的ML Capabilityies更多是指框架工程框架能力,framework support,PSNR和SSIM放到这里不太合适

@ADream-ki
Copy link
Author

@leeleolay 已经按照要求修改了,请review

@leeleolay
Copy link
Collaborator

权利声明辛苦修改为2026年,dataset的部分还没有修改,辛苦再看下review意见 @ADream-ki

@ADream-ki
Copy link
Author

权利声明辛苦修改为2026年,dataset的部分还没有修改,辛苦再看下review意见 @ADream-ki

好的,已经修改了

@ADream-ki
Copy link
Author

dataset部分我刚刚修改好了 @leeleolay

@leeleolay
Copy link
Collaborator

辛苦提供一下和torch精度对齐的数据

@ADream-ki
Copy link
Author

辛苦提供一下和torch精度对齐的数据

image

@ADream-ki
Copy link
Author

@leeleolay

@leeleolay
Copy link
Collaborator

image 辛苦检查一下推理命令,另外辛苦提供两个数据集的两个任务的训练结果

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已经有一个build_spectrum 的工厂函数,建议考虑下是否可以复用这个或者做个兼容性升级

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个里面的模块是否可以借鉴仓库里已有的功能模块来服用。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

contributor External developers

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants