`tensor-core-ntt`

This repository provides an improved implementation of the number theoretic transform (NTT) that leverages Tensor Cores on NVIDIA GPUs.

NTT is a generalization of the fast Fourier transform (FFT) over modular integers, and it is used to perform convolutions and polynomial/multiple-precision arithmetic exactly, in contrast to FFT, which uses floating-point arithmetic and thus suffers from rounding errors.

Through careful analysis, we successfully removed the redundant operations in the previous Tensor Core-based NTT implementations. Along with efficient modular reduction algorithm, we achieved higher performance than the previous work in most settings.

See the paper by Y. Sugizaki and D. Takahashi titled "Improved Implementation of Number Theoretic Transform on NVIDIA GPU with Tensor Cores" (doi:10.1145/3773656.3773673), which was accepted to SupercomputingAsia 2026 / International Conference on High Performance Computing in Asia-Pacific Region 2026 (SCA/HPCAsia 2026), for more details.

License and contribution

For license and copyright notices, see the SPDX file tags in each file. Unless otherwise noted, files in this project are licensed under the Apache License, Version 2.0 (SPDX short-form identifier: Apache-2.0) and copyrighted by the contributors.

Everyone is encouraged to contribute to this project. See the CONTRIBUTING.md file for instructions.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.devcontainer/native		.devcontainer/native
include/polyarith		include/polyarith
tests		tests
.clang-format		.clang-format
.editorconfig		.editorconfig
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`tensor-core-ntt`

License and contribution

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tensor-core-ntt

License and contribution

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`tensor-core-ntt`

Packages