Skip to content

NAVER-INTEL-Co-Lab/gaudi-datapruning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

36 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

βœ‚οΈ Data Pruning Algorithms for Gaudi Environment

This repository provides an implementation of Difficulty and Uncertainty-Aware Lightweight (DUAL) data pruning, along with other data pruning algorithms particularly suitable for the Gaudi environment.

For more details, check out our paper on arXiv


πŸš€ Usage

Please refer to each folder for dataset-specific experiments:

  • exp_cifar for CIFAR (10, 100) experiments
  • exp_imagenet for ImageNet experiments

πŸ“Š Results

DUAL pruning enables efficient dataset pruning without requiring full training on the original dataset while achieving SOTA performance.

DUAL Performance

  • Left: Test accuracy comparison on CIFAR-10 dataset under different pruning ratios.
  • Right: Test accuracy comparison on CIFAR-100 dataset under different pruning ratios.
  • The color represents the total computation time, including the time spent training the original dataset for score calculation for each pruning method. Blue indicates lower computation time, while red indicates higher computation time. Our method demonstrates its ability to minimize computation time while maintaining SOTA performance.

πŸ“Œ Available Data Pruning Algorithms


βŒ›οΈ Gaudi-v2 Performance Comparison (vs. NVIDIA A6000)

NVIDIA A6000 intel Gaudi-v2 (Lazy)
CIFAR (Full) 37m 14s 32m 14s
ImageNet (Full) 35h 20m 38s 19h 54m 1s

πŸ“š Citation

@article{cho2025lightweight,
  title={Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction Uncertainty},
  author={Cho, Yeseul and Shin, Baekrok and Kang, Changmin and Yun, Chulhee},
  journal={arXiv preprint arXiv:2502.06905},
  year={2025}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •