Add `@gr.cache()` decorator for caching deterministic functions, as as well as a lower-level `gr.Cache` that uses dependency injection by abidlabs · Pull Request #13176 · gradio-app/gradio

abidlabs · 2026-04-01T09:46:15Z

Adds @gr.cache() decorator for caching all of the kinds of functions we support in Gradio: regular functions, generators, streaming media, async functions, and async generators. Internally, we now use this same logic when we cache examples, to reduce the duplication a bit. Usage:

@gr.cache()
def generate(prompt):
   ....

See demo/cache_demo/run.py for usage.

I also added a UI indicator if caching is used. I'd love to get thoughts on this, but the idea is to use this as an opportunity to increase the visibility of caching, and also to be transparent to users when a result is fetched from the cache instead of generated from scratch. It shows for a second (in the same place as the "minimal" status tracker to avoid obscuring the output) and then fades out

Also we expose a lower lower-level cache API based on dependency injection:

def generate(prompt, max_new_tokens, c=gr.Cache()):
    c.set(key, value)
    c.get(key)
    ....

The reasons to use this gr.Cache (instead of a global dictionary for example) are that it supports concurrent usage, supports common eviction policies, and the keys can be non-hashable data types commonly used by Gradio users. See demo/cache_kv_demo/run.py for usage. Also if a developer wants to be "safer", they can make a cache be per session instead of global, just by doing gr.Cache(per_session=True).

One other nice thing is that we can keep track of whether there is a cache hit, and, if there is, still show the cache message, which is just slightly different ("used cache"). Open to feedback on the UI!

Also adds tests, 3 demos, and a guide to demonstrate usage.

gradio-pr-bot · 2026-04-01T09:46:46Z

🪼 branch checks and previews

•	Name	Status	URL
	Spaces	ready!	Spaces preview
	Website	building...
	Storybook	ready!	Storybook preview
🦄	Changes	failed!	Workflow log

Install Gradio from this PR

pip install https://huggingface.co/buckets/gradio/pypi-previews/resolve/a8606d6622f8f4c8cec3c8b752443f5e9c5b8358/gradio-6.11.0-py3-none-any.whl

Install Gradio Python Client from this PR

pip install "gradio-client @ git+https://github.com/gradio-app/gradio@a8606d6622f8f4c8cec3c8b752443f5e9c5b8358#subdirectory=client/python"

Install Gradio JS Client from this PR

npm install https://gradio-npm-previews.s3.amazonaws.com/a8606d6622f8f4c8cec3c8b752443f5e9c5b8358/gradio-client-2.1.0.tgz

gradio-pr-bot · 2026-04-01T09:47:03Z

🦄 change detected

This Pull Request includes changes to the following packages.

Package	Version
`@gradio/client`	`minor`
`@gradio/markdown-code`	`minor`
`@gradio/statustracker`	`minor`
`gradio`	`minor`

Add @gr.cache() decorator for caching deterministic functions, as as well as a lower-level gr.Cache that uses dependency injection

‼️ Changeset not approved. Ensure the version bump is appropriate for all packages before approving.

Maintainers can approve the changeset by checking this checkbox.

Something isn't right?

Maintainers can change the version label to modify the version bump.
If the bot has failed to detect any changes, or if this pull request needs to update multiple packages to different versions or requires a more comprehensive changelog entry, maintainers can update the changelog file directly.

Co-authored-by: hysts <hysts@users.noreply.github.com>

abidlabs · 2026-04-06T17:56:30Z

Thanks so much @hysts @freddyaboulton for the great review! Will investigate and fix these issues

guides/04_additional-features/17_caching.md

abidlabs · 2026-04-06T23:57:30Z

One thing I noticed is that cache hit durations seem to be included in the average execution time calculation. So if the first (uncached) call takes 100s, after a few cache hits the displayed average drops to ~50s, ~33s, etc., making it hard to tell what the actual computation cost is. It might be better to exclude cache hits from the average so the "saved time" comparison stays meaningful.

Nice catch, should be fixed now @hysts!

abidlabs · 2026-04-07T00:00:59Z

Everything should be fixed now @freddyaboulton @hysts if you can kindly give it another pass!

hysts

Thanks for the update! LGTM!

freddyaboulton

Thanks for making the changes @abidlabs ! Left some comments. Good to merge once those are addressed. I think right now the chatbot won't show if there's been a cache hit because we manually surpress the status tracker but it would be good to maybe show the cache hit since the behavior of chatbot streaming is quite different if there is a cache hit or not. Not blocking though.

js/markdown-code/prism.css

gradio/queueing.py

gradio/caching.py

gradio/blocks.py

gradio/routes.py

abidlabs · 2026-04-08T18:20:26Z

Thanks so much for the careful review @freddyaboulton and @hysts! Will merge this in once CI passes

abidlabs added 2 commits March 31, 2026 17:05

changes

7d2d1b5

changes

c27a268

add changeset

9fe570f

abidlabs added 2 commits April 1, 2026 02:49

Remove benchmark results from cache branch

a8888c8

changes

93358d4

abidlabs changed the title ~~Add (partial) caching with support for storing intermdiate caching to Gradio~~ Add (partial) caching with support for storing intermediate caching state Apr 1, 2026

abidlabs and others added 13 commits April 1, 2026 09:56

Merge branch 'main' into cachee

bc08411

add changeset

107c7da

changes

34fdfe0

changes

d0259c6

changes

3e97494

changes

c2e101f

changes

783fecf

changes

55e2ac6

changes

537a8be

changes

a149dce

Merge branch 'main' into cachee

ba6a494

changes

a859617

changes

24f35c1

abidlabs changed the title ~~Add (partial) caching with support for storing intermediate caching state~~ Add @gr.cache() decorator for caching deterministic functions, generators, etc. Apr 2, 2026

gradio-pr-bot and others added 5 commits April 2, 2026 19:29

add changeset

1d2a3c5

changes

9c0a6ea

changes

569a940

changes

f785e48

Merge branch 'main' into cachee

73dd151

abidlabs changed the title ~~Add @gr.cache() decorator for caching deterministic functions, generators, etc.~~ Add @gr.cache() decorator for caching deterministic functions, as as well as a lower-level cache API based on dependency injection Apr 2, 2026

gradio-pr-bot and others added 2 commits April 2, 2026 19:58

add changeset

0fb75be

changes

fd0c6f3

abidlabs and others added 2 commits April 6, 2026 10:50

Update demo/cache_kv_demo/run.py

1602f73

Co-authored-by: hysts <hysts@users.noreply.github.com>

Merge branch 'main' into cachee

d96bc46

abidlabs added 6 commits April 6, 2026 11:16

changes

05cd920

changes

afb25ce

changes

ae1c1c9

changes

168767a

changes

0261b91

changes

23a557e

abidlabs commented Apr 6, 2026

View reviewed changes

guides/04_additional-features/17_caching.md Outdated Show resolved Hide resolved

abidlabs and others added 4 commits April 6, 2026 16:51

Apply suggestion from @abidlabs

637ba9f

changes

9f83c45

changes

1540c85

changes

a148c7d

add changeset

2e561c3

hysts approved these changes Apr 7, 2026

View reviewed changes

freddyaboulton approved these changes Apr 7, 2026

View reviewed changes

js/markdown-code/prism.css Show resolved Hide resolved

gradio/queueing.py Outdated Show resolved Hide resolved

gradio/caching.py Outdated Show resolved Hide resolved

gradio/blocks.py Outdated Show resolved Hide resolved

gradio/routes.py Show resolved Hide resolved

abidlabs and others added 3 commits April 8, 2026 11:04

Merge branch 'main' into cachee

107f09a

empty commit

6d59edd

changes

919e7a5

abidlabs and others added 3 commits April 8, 2026 11:40

changes

6e899ef

Merge branch 'main' into cachee

530a32a

changes

a8606d6

abidlabs merged commit 45c4ecd into main Apr 8, 2026
1 check passed

abidlabs deleted the cachee branch April 8, 2026 18:41

gradio-pr-bot mentioned this pull request Apr 8, 2026

chore: update versions #13200

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `@gr.cache()` decorator for caching deterministic functions, as as well as a lower-level `gr.Cache` that uses dependency injection#13176

Add `@gr.cache()` decorator for caching deterministic functions, as as well as a lower-level `gr.Cache` that uses dependency injection#13176
abidlabs merged 72 commits intomainfrom
cachee

abidlabs commented Apr 1, 2026 •

edited

Loading

Uh oh!

gradio-pr-bot commented Apr 1, 2026 •

edited

Loading

Uh oh!

gradio-pr-bot commented Apr 1, 2026 •

edited

Loading

Something isn't right?

Uh oh!

abidlabs commented Apr 6, 2026

Uh oh!

Uh oh!

abidlabs commented Apr 6, 2026

Uh oh!

abidlabs commented Apr 7, 2026

Uh oh!

hysts left a comment

Uh oh!

freddyaboulton left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abidlabs commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

abidlabs commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gradio-pr-bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🪼 branch checks and previews

Uh oh!

gradio-pr-bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦄 change detected

This Pull Request includes changes to the following packages.

Something isn't right?

Uh oh!

abidlabs commented Apr 6, 2026

Uh oh!

Uh oh!

abidlabs commented Apr 6, 2026

Uh oh!

abidlabs commented Apr 7, 2026

Uh oh!

hysts left a comment

Choose a reason for hiding this comment

Uh oh!

freddyaboulton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abidlabs commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

abidlabs commented Apr 1, 2026 •

edited

Loading

gradio-pr-bot commented Apr 1, 2026 •

edited

Loading

gradio-pr-bot commented Apr 1, 2026 •

edited

Loading