In a line line this https://github.com/GridTools/cpp_bindgen/blob/d0b4b216f141da90f8534356cccbb59423667e8f/include/cpp_bindgen/common/any_moveable.hpp#L61 it turns out the overhead can be 20x more than a regular function call. Could this be improved for the case of low-latency applications?