-
Notifications
You must be signed in to change notification settings - Fork 24
Description
Description
Hi, I've been looking for a method to compute emissions linked to cached tokens to apply it to my use case (LLM usage for software development throuhg Cursor/Claude & large context windows) and recently attempted to add KV-cache emissions to the calculation engine in this PR : #200.
The update is according to this paper detailing KV-cache impact on GPU power consumption + model latency. As I'm not an expert in this matter, I'd like your input/advice on this methodology and if it would be possible to add it to Ecologits' main calculation engine.
This feature seems important as cached tokens increasingly represent a large part of developers' token usage (based on monitoring developers' Cursor usage in my team) and I haven't found an accurate way of using this token count thourgh Ecologits yet.
If something is missing, how can I help to efficiently contribute to this feature ?
Brainstorming possible future features :
- Upload Cursor/other tool personal usage log to automatically calculate developers total emissions
- In Ecologits online calculator, provide the option of selecting context window scenarios such as "scan entire codebase/max model context window usage", "one-off agent edit", "25 successive queries to a single agent", ...