https://openai.com/api/pricing/#:~:text=Our%20APIs-,Realtime%20API,-Build%20low%2Dlatency The problem with them is that they have extra cache prices for text, audio and image. Image Tokens are currently also not in the spec.