Skip to content

Pytorch project accompanying the paper "A Unified Perspective on CTC and SDTW using Differentiable DTW", submitted to IEEE TASLPRO, 2025

License

Notifications You must be signed in to change notification settings

groupmm/dDTW_CTC

Repository files navigation

Accompanying code for:

A Unified Perspective on CTC and SDTW using Differentiable DTW

Johannes Zeitler ([email protected])
International Audio Laboratories Erlangen
December 2025

Overview

This repository contains code to reproduce all experiments in the paper. The experiments are separated into single folders:

  • 01_single_label_MTD/
  • 02_multi_label_PCE/
    • contains the experiment on multi-label classification (polyphonic pitch class estimation)
    • requires the Schubert Winterreise Dataset (SWD), see https://zenodo.org/records/10839767
  • 03_runtime_memory/
    • contains the experiment on runtime and memory consumption
    • does not require additional resources
  • dDTW_toolbox/
    • contains a CUDA-optimized implementation of the dDTW algorithm, programmed exactly as described in the paper
    • the toolbox implementation is used for all experiments

Getting started

If you want to see how dDTW can replace an element-wise loss function or CTC out-of-the-box, take a look at ./01_single_label_MTD/trainScript.py

Notes

To reduce the memory footprint of this repository, we do not include all training datasets. The MTD and SWD need to be acquired separately. Furthermore, we provide only the trained models with the lowest validation loss for each model/loss configuration.

If you find this code useful...

please consider citing our paper

Johannes Zeitler and Meinard Müller, "A Unified Perspective on CTC and SDTW using Differentiable DTW", submitted to IEEE Transactions on Audio, Speech, and Language Processing, 2025.

About

Pytorch project accompanying the paper "A Unified Perspective on CTC and SDTW using Differentiable DTW", submitted to IEEE TASLPRO, 2025

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published