This repository serves as the central hub for a machine learning potentials (MLP) project, integrating customized versions of MACE and ASE along with a comprehensive suite of utility scripts for data generation, training, uncertainty analysis (global, local), and active learning. The atomic structures used for training are also included as .xyz files for transparency and reproducibility.
This repository includes two modified forks as Git submodules:
external/mace_nodecustom version of MACEexternal/ase_multiproc_calcmodified ASE with multiprocessing calculator support
The scripts/ folder organizes all supporting code used throughout the MLP workflow:
data_generationstructure files used in model initialization and benchmarkingmlp_trainingtraining and analysis of machine learning potentialsglobal_uncertaintyanalysis of uncertainty in ensemble predicted total energyuniqueness_analysiscluster analysis of atomic environments in structured datasetlocal_uncertaintyInvestigations of uncertainty in atomic energy and forces, and associated structural analysisactive_learningretraining based on local energy uncertainty sampling
The xyz_files directory contains the datasets used for training in extended XYZ format. These include the initial seeds and active learning configurations.
The examples/ directory contains usage examples of ensemble-based simulations and uncertainty-driven sampling:
ASE_multiproc_calc: Demonstrates a multiprocessing-enabled ASE interface for evaluating ANN ensemble models (e.g., aenet) on CPUs, enabling fast MD with uncertainty estimation.MACE_energybias: Shows how to perform uncertainty-biased simulations using the MACE ASE calculator by modifying the potential energy with an ensemble uncertainty-based Gaussian bias.