feat: AI auto-tagging #1232

emanuelebeffa · 2025-11-27T22:32:11Z

Adds AI-powered auto-tagging functionality using OpenAI compatible APIs. Users can configure their API key, model, base URL and define a vocabulary of allowed tags, then use the "Refresh AI tags" action to automatically generate tag suggestions for their bookmarks based on content (URL, title, description).

The feature includes:

API key validation
Base URL support to use any OpenAI compatible APIs (e.g. Ollama)
Configurable tag vocabulary to constrain AI suggestions
Bulk tagging
Unit tests

Estimated cost analysis

Using OpenAI's gpt-5-nano model at $0.05 per 1M input tokens and $0.4 per 1M output tokens:

Average cost per bookmark: ~$0.000043

Input: ~250 tokens
Cost: $0,0000125
Output: ~100 tokens
Cost: $0.00004

Volume	Cost
100 bookmarks	~$0.0043
1,000 bookmarks	~$0.043

Possible future evolution

Batch processing optimizations for large bulk operations, further reducing costs
Prompt customization
List supported models

oliexe · 2025-12-02T19:57:23Z

This is awesome. Lgtm, hopefully this will get merged soon.

7jrxt42BxFZo4iAnN4CX · 2025-12-08T17:08:37Z

@sissbruecker we are waiting for this

Eragos · 2025-12-08T17:53:41Z

@7jrxt42BxFZo4iAnN4CX sadly this is only a single OpenAI, not a self-hosted Ollama or other AI-APIs solution… The difference is not so bad but should be carefully thought of – my2ct

7jrxt42BxFZo4iAnN4CX · 2025-12-08T18:10:20Z

@Eragos Then expand to any OpenAI compatible ones? I'd like an open router.

emanuelebeffa · 2025-12-08T18:15:02Z

@7jrxt42BxFZo4iAnN4CX sadly this is only a single OpenAI, not a self-hosted Ollama or other AI-APIs solution… The difference is not so bad but should be carefully thought of – my2ct

I’m already working on Ollama integration, I was just waiting for this PR to be merged first and to check whether there were any blockers

7jrxt42BxFZo4iAnN4CX · 2025-12-08T19:00:04Z

@emanuelebeffa Great news, that would be great.
We are waiting for merge.

emanuelebeffa · 2025-12-09T12:04:04Z

I almost finished Ollama integration, I’ll be reopening the PR soon

7jrxt42BxFZo4iAnN4CX · 2025-12-10T19:03:45Z

@emanuelebeffa

This review was generated with AI assistance

Review Summary

Thanks for this feature! The implementation is well-structured with good test coverage. I found a few issues worth addressing before merge.

🔴 Critical

Bulk "Refresh AI tags" doesn't work for bookmarks with existing tags

The documentation states:

"This will replace the existing tags with new AI-generated suggestions"

However, _auto_tag_bookmark_task skips bookmarks that already have tags:

if bookmark.tags.exists():
    logger.info(f"Skipping AI tagging - bookmark {bookmark_id} already has tags")
    return

Suggested fix: Add a force parameter to allow bulk refresh to override existing tags:

def auto_tag_bookmark(user: User, bookmark: Bookmark, force: bool = False):
    # ...
    _auto_tag_bookmark_task(bookmark.id, user.id, force)

@task()
def _auto_tag_bookmark_task(bookmark_id: int, user_id: int, force: bool = False):
    if bookmark.tags.exists() and not force:
        return
    # ...

Then in refresh_ai_tags, pass force=True.

🟠 Important

1. API key exposed in API response

ai_api_key is included in UserProfileSerializer fields without write_only=True. While only the profile owner can access it, this is still risky (XSS, logging, etc.).

Suggested fix:

ai_api_key = serializers.CharField(write_only=True, required=False, allow_blank=True)

2. No rate limiting for bulk AI operations

Selecting many bookmarks for bulk refresh could trigger hundreds of API calls, potentially exceeding provider rate limits or incurring unexpected costs.

Suggestion: Consider adding a limit or at least a warning in the UI.

🟡 Minor / Nice-to-have

Timeout: Consider adding explicit timeout to OpenAI client (default is 10 min, which is quite long)
Vocabulary size limit: Large tag lists increase API costs per request

Overall, great work! The Pydantic structured outputs, error handling with retry logic for 5xx errors, and hallucination filtering are all well done. 👍

emanuelebeffa · 2025-12-11T12:55:47Z

Thanks for the review! I've pushed fixes for Bulk "Refresh AI tags" doesn't work for bookmarks with existing tags, API key exposed in API response and Timeout. I've also pushed UI warnings for No rate limiting for bulk AI operations and Vocabulary size limit.

7jrxt42BxFZo4iAnN4CX · 2025-12-11T14:39:52Z

Looks great! I've reviewed the latest changes, and they successfully address all the previous points.

The implementation looks solid and ready to go.
Waiting for @sissbruecker for the final review and merge. 🚀

emanuelebeffa added 3 commits November 25, 2025 22:59

feat: AI auto tagging

7abbfd4

feat: form error handling

ba81cf6

feat: tests for ai tagger

392986a

emanuelebeffa mentioned this pull request Nov 27, 2025

Add AI Tagging #917

Open

fix: regression in tests

40e28eb

emanuelebeffa closed this Dec 9, 2025

emanuelebeffa added 2 commits December 9, 2025 20:03

feat: support for OpenAI-compatible APIs

1a4d65f

docs: AI auto-tagging

09f6c80

emanuelebeffa reopened this Dec 9, 2025

emanuelebeffa changed the title ~~feat: AI auto tagging~~ feat: AI auto-tagging Dec 9, 2025

emanuelebeffa added 6 commits December 11, 2025 11:00

feat: force AI auto-tagging

47af688

feat: mask AI API key

2a37a2a

feat: bulk AI auto-tagging warning

82c9be2

feat: OpenAI client timeout

b2d19d1

feat: vocabulary size warning

9efe086

fix: regression in tests

673658c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: AI auto-tagging #1232

feat: AI auto-tagging #1232

Uh oh!

emanuelebeffa commented Nov 27, 2025 •

edited

Loading

Uh oh!

oliexe commented Dec 2, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 8, 2025

Uh oh!

Eragos commented Dec 8, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 8, 2025

Uh oh!

emanuelebeffa commented Dec 8, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 8, 2025

Uh oh!

emanuelebeffa commented Dec 9, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 10, 2025

Uh oh!

emanuelebeffa commented Dec 11, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: AI auto-tagging #1232

Are you sure you want to change the base?

feat: AI auto-tagging #1232

Uh oh!

Conversation

emanuelebeffa commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Estimated cost analysis

Possible future evolution

Uh oh!

oliexe commented Dec 2, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 8, 2025

Uh oh!

Eragos commented Dec 8, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 8, 2025

Uh oh!

emanuelebeffa commented Dec 8, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 8, 2025

Uh oh!

emanuelebeffa commented Dec 9, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 10, 2025

Review Summary

🔴 Critical

🟠 Important

🟡 Minor / Nice-to-have

Uh oh!

emanuelebeffa commented Dec 11, 2025

Uh oh!

7jrxt42BxFZo4iAnN4CX commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

emanuelebeffa commented Nov 27, 2025 •

edited

Loading