Add KV-cache contribution to total emissions

### Description

Hi, I've been looking for a method to compute emissions linked to cached tokens to apply it to my use case (LLM usage for software development throuhg Cursor/Claude & large context windows) and recently attempted to add KV-cache emissions to the calculation engine in this PR : [https://github.com/mlco2/ecologits/pull/200](https://github.com/mlco2/ecologits/pull/200).

The update is according to [this paper ](https://kipp.ly/transformer-inference-arithmetic/) detailing KV-cache impact on GPU power consumption + model latency. As I'm not an expert in this matter, I'd like your input/advice on this methodology and if it would be possible to add it to Ecologits' main calculation engine. 

This feature seems important as cached tokens increasingly represent a large part of developers' token usage (based on monitoring developers' Cursor usage in my team) and I haven't found an accurate way of using this token count thourgh Ecologits yet.

If something is missing, how can I help to efficiently contribute to this feature ?

Brainstorming possible future features : 
- Upload Cursor/other tool personal usage log to automatically calculate developers total emissions
- In Ecologits online calculator, provide the option of selecting context window scenarios such as "scan entire codebase/max model context window usage", "one-off agent edit", "25 successive queries to a single agent", ...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add KV-cache contribution to total emissions #201

Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add KV-cache contribution to total emissions #201

Description

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions