Open
Description
As noted in https://www.phoronix.com/scan.php?page=article&item=clang12-gcc11-icelake&num=2, there is a huge difference in compression speed between clang and gcc in long mode.
Standard mode looks kinda fine.
Missed vectorization? Any chance to improve it?