When optimizer is enabled, gpt-oss-20b encounters the following error from reshape op:
Always | TT_FATAL: Can only work with bfloat16/float32 or int32/uint32 tensors, but got DataType::UINT16
I edited the error message to display which dtype it's running into. This error does not happen when optimizer is disabled.
gpt_oss_20b_tp_1x8_with_optimizer.log