DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer

This repository contains the official implementation of the paper DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer.

[paper] [CVPR paper]

Installation

NMS

cd util
python setup.py install --user # build NMS
cd ..

Temporal Deformable Attention

cd models/digit/ops
python setup.py build install
cd ../../..

Prepare Dataset

We follow ActionFormer repository and Video Mamba Suite for preparing datasets including THUMOS14, ActivityNet v1.3, and HACS-Segment.

Use scripts/make_feature_info.py to generate feature information for each dataset. THUMOS14 is already prepared in the repository.

Training

To train the DiGIT model on the THUMOS14 dataset, execute the following command:

python main.py --c config/digit/internvideo2/thumos14.py --output_dir logs/thumos14

Evaluation

To evaluate the trained model and obtain performance metrics, use the following command structure:

python main.py --eval --c config/digit/internvideo2/thumos14.py --output_dir logs/thumos14

Citation

if you find our work helpful, please consider citing our paper:

@InProceedings{Kim_2025_CVPR,
    author    = {Kim, Ho-Joong and Lee, Yearang and Hong, Jung-Ho and Lee, Seong-Whan},
    title     = {DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer},
    booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
    month     = {June},
    year      = {2025},
    pages     = {24286-24296}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
config/digit		config/digit
data		data
datasets		datasets
models		models
util		util
.gitignore		.gitignore
LICENCE		LICENCE
README.md		README.md
config_args_raw.json		config_args_raw.json
config_cfg.py		config_cfg.py
engine.py		engine.py
main.py		main.py
make_feature_info.py		make_feature_info.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer

Installation

NMS

Temporal Deformable Attention

Prepare Dataset

Training

Evaluation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer

Installation

NMS

Temporal Deformable Attention

Prepare Dataset

Training

Evaluation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages