New feature: OnnxPredict algorithm #1488

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

xaviliz wants to merge 67 commits into MTG:master from xaviliz:feat/onnx-predict

+1,035 −7

Contributor

xaviliz commented Sep 1, 2025 •

edited

Loading

New feature: OnnxPredict algorithm

Feature

It makes additional changes in Essentia library to build ONNX Runtime Inferencing library from the source and to implement a new algorithm OnnxPredict for running ONNX models (.onnx) with multiple IO.

Implementation

Provide a new building script for ONNX Runtime Inferencing library.
Modify Essentia scripts to link with the onnxruntime dynamic library.
Implement new algorithm OnnxPredict to run ONNX models in Essentia.
Implement unittests in test_onnxpredict.py

Prerequisites

python >= 3.10
cmake >= 3.28

Testing

Builds successfully with ONNX Runtime v1.22.1 in MacOS
- ARM64
- x86_64
Builds successfully with ONNX Runtime v1.22.1 in Linux
Multiple input inferencing
Multiple output inferencing
No runtime errors or compatibility issues

How to Test

Tested in onnxruntime-v1.22.1:

MacOS with an ARM64 machine with python 3.13.4 and cmake 4.0.2
Linux docker with python 3.10.18 and cmake 4.1.0

How to build ONNX Runtime

After installing Essentia dependencies in a virtual environment, install cmake

python3 -m pip install cmake
which cmake

Then we can run the building script:

cd packaging/debian_3rdparty
bash build_onnx.sh

How to build OnnxPredict

In MacOS:

source .env/bin/activate
python3 waf configure --fft=KISS --include-algos=OnnxPredict,Windowing,Spectrum,MelBands,UnaryOperator,TriangularBands,FFT,Magnitude,NoiseAdder,RealAccumulator,FileOutputProxy,FrameCutter --static-dependencies --pkg-config-path=/packaging/debian_3rdparty/lib/pkgconfig --with-onnx --lightweight= --with-python --pythondir=.env/lib/python3.13/site-packages
python3 waf -v && python3 waf install

In Linux:

python3 waf configure --fft=KISS --include-algos=OnnxPredict,Windowing,Spectrum,MelBands,UnaryOperator,TriangularBands,FFT,Magnitude,NoiseAdder,RealAccumulator,FileOutputProxy,FrameCutter --static-dependencies --with-onnx --lightweight= --with-python --pkg-config-path /usr/share/pkgconfig --std=c++14
python3 waf -v && python3 waf install

How to unittest

# prepare essentia audio repo
git clone https://github.com/MTG/essentia-audio.git test/essentia-audio
rm -rf test/audio && mv test/essentia-audio test/audio

# download effnet.onnx model for testing
curl https://essentia.upf.edu/models/feature-extractors/discogs-effnet/discogs-effnet-bsdynamic-1.onnx --output test/models/discogs-effnet-bsdynamic-1.onnx
python3 test/src/unittests/all_tests.py onnxpredict

xaviliz added 30 commits

June 16, 2025 10:19


          Add initial script for building onnx-runtime.

98994c6


          Modify wscript for onnx flag.

4e92dc0


          Define algos for onnx-runtime and create --with-onnx flag.

d673012


          Support --with-onnx

4246f5b


          Rename library to be detected by pkg-config

9720e28


          Add initial onnxpredict.cpp files.

d60385e


          Provide a constructor and solve some errors.

3bb07a9


          Fix status errors.

90da5d4


          Update ONNX Runtime versioning

7d357be


          Support for MacOS building at arm64

56da948


          Support for multi IO models

d9c8a49


          Except OnnxPredict algorithm to use pool in cpp

d136388


          First onnx runtime unittest with effnet model

f46d481


          Merge branch 'master' into feat/onnx-predict

4798b8b


          Add support for building onnxruntime in Linux.

bce71f9


          Save onnxruntime files after building

28da0a9


          Copy all dynamic library files

0da9800


          Fix stem in audio path

88787a2


          Clean and update the parameter declaration

1ee0fe6


          Small clean

83ed01d


          Fix issue in the EssentiaExceptions for name parsing.

5a300fc


          Add TestIONameParser()

4f7955c


          Adapt testEmptyModelName() unittest

29d0e2b


          Add testInvalidParams() as unitest

2655a03


          Update effnetdiscogs-bsdynamic-1.onnx location

2c42e91


          Throw EssentiaException for empty model names.

c766952


          Updated test/models submodule

35b0228


          Updated test/audio submodule

4302cb9


          Polish ORT building command for MacOS

9a09d95


          Initial support for multi input models

b552601

xaviliz added 2 commits

September 17, 2025 16:31


          Fix issue in testIgnoreInvalidReconfiguration() for Linux

46727b7


          Clean and small change

palonso reviewed

View reviewed changes

Contributor

palonso left a comment

Great work @xaviliz !!
I left some comments. Some are questions about things I didn't understand

packaging/debian_3rdparty/build_onnx.sh Outdated

    
              OS=$(uname -s)

              CONFIG=Release

              if [ "$OS" = "Darwin" ]; then

Contributor

palonso Sep 22, 2025

@xaviliz, since we are inside debian_3rdparty, should we remove or move somewhere else the MacOS support?

Contributor Author

xaviliz Sep 23, 2025

Yes, that's true. I kept it for testing purpouses. Let me clean it a bit.

Contributor Author

xaviliz Sep 23, 2025

It has been tested on Linux.

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
              const char* OnnxPredict::name = "OnnxPredict";

              const char* OnnxPredict::category = "Machine Learning";

              const char* OnnxPredict::description = DOC("This algorithm runs a Onnx graph and stores the desired output tensors in a pool.\n"

Contributor

palonso Sep 22, 2025

an ONNX graph?

Contributor Author

xaviliz Sep 23, 2025

it should be an ONNX model, there is no access to graphs in onnxruntime. It is fixed now.

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
                // Do not do anything if we did not get a non-empty model name.

                if (_graphFilename.empty()) return;

                cout << "after return" << endl;

Contributor

palonso Sep 22, 2025

Clean debug output

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
                  _env = Ort::Env(ORT_LOGGING_LEVEL_WARNING, "multi_io_inference"); // {"default", "test", "multi_io_inference"}

                  // Set graph optimization level - check https://onnxruntime.ai/docs/performance/model-optimizations/graph-optimizations.html

                  _sessionOptions.SetGraphOptimizationLevel(GraphOptimizationLevel::ORT_ENABLE_EXTENDED);

Contributor

palonso Sep 22, 2025

Since there are different optimization options, I'm wondering if there is a chance that extended optimization doesn't work or affects model performance in some cases. I think this should be turned into a parameter that defaults to extended.

https://onnxruntime.ai/docs/performance/model-optimizations/graph-optimizations.html#graph-optimization-levels

Contributor Author

xaviliz Sep 23, 2025

That's a good point, I am not sure how optimizations could affect the performance. Adding new parameter sounds good to me. So, do you propose to add boolean parameter for each optimization? or just an string to use one of them?

Contributor Author

xaviliz Dec 23, 2025

New optimizationLevel parameter has been added as string with choices: {disable_all, basic, extended, all}, by default extended. Maybe it is nice to add some additional tests, what do you think about comparing outputs for the identity model in all extensions?

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
                  // Set graph optimization level - check https://onnxruntime.ai/docs/performance/model-optimizations/graph-optimizations.html

                  _sessionOptions.SetGraphOptimizationLevel(GraphOptimizationLevel::ORT_ENABLE_EXTENDED);

                  // To enable model serialization after graph optimization set this

                  _sessionOptions.SetOptimizedModelFilePath("optimized_file_path");

Contributor

palonso Sep 22, 2025

I think this is mainly intended for debugging purposes. Can we skip saving the optimized graph for efficiency?

https://onnxruntime.ai/docs/api/c/struct_ort_api.html#ad238e424200c0f1682947a1f342c39ca

Contributor Author

xaviliz Sep 23, 2025

yes, we don't need to store the optimized graph in a model.

src/algorithms/machinelearning/onnxpredict.cpp

    
                return out;

              }

              void OnnxPredict::reset() {

Contributor

palonso Sep 22, 2025

Shouldn't we reset the session and env too?

Contributor Author

xaviliz Sep 26, 2025

That's a good point. I couldn't find a reset method for session and env in the CPP_API like in Tensorflow. But let me try it using std::unique_ptr maybe that could work. however, I am doubting if we should do that after compute(), because if we reset the session at the end of configure(), session.Run() will fail.

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
                const Pool& poolIn = _poolIn.get();

                Pool& poolOut = _poolOut.get();

                std::vector<std::vector<float>> input_datas;  // <-- keeps inputs alive

Contributor

palonso Sep 22, 2025

input_datas -> input_data?
I think data is already plural

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
                  // Step 2: Convert data to float32

                  input_datas.emplace_back(inputData.size());

                  for (size_t j = 0; j < inputData.size(); ++j) {

                      input_datas.back()[j] = static_cast<float>(inputData.data()[j]);

Contributor

palonso Sep 22, 2025

Instead of forcing casting data to float, shouldn't we keep it in Real format (that is actually float32 by default) and make sure that the model runs in whatever type Real points to?

Contributor Author

xaviliz Dec 23, 2025

That's true when Real == float but we need fall back to cast when Real == double. So, we should not try to make ONNX Runtime run on “whatever Real points to.” However the redundant cast when Real == float could be avoided.

Contributor Author

xaviliz Dec 23, 2025 •

edited

Loading

Fixed #1488! Essentia::Real is already float32 by default, so no need to cast ;)

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
                  }

                  // Step 3: Create ONNX Runtime tensor

                  _memoryInfo = Ort::MemoryInfo::CreateCpu(OrtArenaAllocator, OrtMemTypeDefault);

Contributor

palonso Sep 22, 2025

Would it be possible to run the models on GPU if available?

Contributor Author

xaviliz Dec 23, 2025

yes, it is 6131728
Maybe it is nice to add some test for these functionalities. I couldn't test them properly yet.

src/python/essentia/standard.py Outdated

    
              def _create_essentia_class(name, moduleName = __name__):

                  essentia.log.debug(essentia.EPython, 'Creating essentia.standard class: %s' % name)

                  # print(f"name: {name}")

Contributor

palonso Sep 22, 2025

remove debug print

xaviliz added 26 commits

September 23, 2025 11:31


          Remove MacOS support

509b991


          Clean debug output

00c09ee


          Skip saving the optimized graph

6dc8f55


          Set to 0 the intraop number of threads

881fe48


          Improve setTensorInfos() for readablity

cac1cb4


          Raise an exception if no input is provided

6b971f6


          Small clean

1f8f0fb


          Fix testInference() unitest after declaring inputs as mandatory param…

679b2fb

…eter


          Small change

31b7964


          Excepts when no outputs are defined

e2c6c2a


          Clean unused checkers for inputs and outputs

f86fd52


          Replace std::cout by E_INFO()

59ae04e


          Add onnxruntime library in the Debian building pipeline.

108ee4c


          Configure GPU providers dynamically

442bae5


          Handle parallel computational providers with macros to avoid errors w…

580a62c

…hen cuda, metal or open_ml are not compiled in onnxruntime library


          Support for resetting Ort::Session and Ort::SessionOption

382dd17


          Check input and output nodes before any computating

8e137b3


          Reset Ort::Session and Ort::SessionOptions for each configure()

37e6912


          Create Ort::Env once per application and reuse it for all sessions

1cdcb2b


          Add deviceId parameter to specify gpu id for inferencing when CUDA, M…

74a585b

…ETAL or OPEN_ML providers are found


          Support for optimizationLevel parameter to control graph optimization…

38410fa

… level in ORT.

- Declared as a new string parameter with choices: {disable_all,basic,extended,all}, by default “extended”.
	- https://onnxruntime.ai/docs/performance/model-optimizations/graph-optimizations.html#levels
- Set graph optimization level in ORT Session.


          Refactor input_datas to inputDataVector

df455f2


          Keep Real (float32) as-is avoiding float cast

9ccab29


          Add tests related to the graph optimization level:

93424b7

- `test_default_optimization_level()`: Check that the default optimization level is 'extended'.
- `test_set_valid_optimization_levels()`: Check that valid optimization levels can be set without errors.
- `test_set_invalid_optimization_level()`: Check that invalid optimization levels raise an error.


          Enable CUDA tensor execution for OnnxPredict when available

- CUDA tensors now use Ort::MemoryInfo::CreateCuda to allocate GPU memory.
- Metal and CoreML providers continue to use CPU tensors; data is managed internally by the provider.
- Execution provider is auto-selected based on availability and _deviceId.


          Remove debug print

6bf2f97

Contributor Author

xaviliz commented Dec 23, 2025 •

edited

Loading

Thank you @palonso, all changes/suggestions have been addressed.
Please when you can revise and give me feedback for the changes.
I think it is almost done.

Before merging it would be nice to:

Test the algorithm in Linux with the last changes, but it should work fine.
Test it in OSX with METAL and OPEN_ML.
Make a brew cask to test the essentia installation with the onnxruntime package
- I think it was failing because it was builded without some static dependencies flag.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet