I think some the models I have tested do use resoning but I cannot see the reasoning token consumption in the ocmonitor models breakdown:
I know that the token count for reasoning may be different for each model provider (some return the implicitly as "reasoning tokens", other just add them the output tokens).
Since reasoning tokens are charged like output tokens normally it would be helpful so see the reasoning token count too in the output.
Also: Additionally to the "Total Tokens" col I think it could make sense to also show the cached reads and write to make the "cost" col more understandable.
I think some the models I have tested do use resoning but I cannot see the reasoning token consumption in the
ocmonitor modelsbreakdown:I know that the token count for reasoning may be different for each model provider (some return the implicitly as "reasoning tokens", other just add them the output tokens).
Since reasoning tokens are charged like output tokens normally it would be helpful so see the reasoning token count too in the output.
Also: Additionally to the "Total Tokens" col I think it could make sense to also show the cached reads and write to make the "cost" col more understandable.