You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
oneDNN enables 4bits for matmul, like u4/s4 and float-4bit. However, there is no document or example about 4-bit data type. Especially, 4bit storage structure and use cases in the real inference.
Could you provide the more details to describe it?