Proposal
Based on https://opensourcemechanistic.slack.com/archives/C07EHMK3XC7/p1762007126888369, the user is getting an error because their context_size is much lager than the total tokens they're requesting to cache. We should provide a helpful error message in this case.