Open
Description
Answers checklist.
- I have read the ESP-DSP documentation and the issue is not addressed there.
- I have read the documentation ESP-IDF Programming Guide and the issue is not addressed there.
- I have updated my ESP-DSP branch (master or release) to the latest version and checked that the issue is present there.
- I have searched the issue tracker for a similar issue and not found a similar issue.
General issue report
According to this page https://docs.espressif.com/projects/esp-dsp/en/latest/esp32/esp-dsp-benchmarks.html, dsps_dotprod_s16_ae32() for len=256 completes in 447 cpu cycles. That is - I guess - 1 cycle/mac + overhead.
However when I try to measure it using the ccount register, I measure over 950.
First I used xthal_get_ccount() and redid the measurement using inline asm (asm volatile("rsr %0,ccount":"=a" (ccount));) that yielded ~the same.
In my setup, the core that runs the test is empty besides this task. I run 16000 measurements and then evaluate the results.
What am I missing? How can I reproduce the stock results?
Thanks,
Viktor