Ability for PR-Agent to continuously improve using user feedback (learning from accepted suggestions & thumbs-up responses)

### Feature request

Hello Qodo team 👋

We are currently using PR-Agent in our company setup, and we’ve been exploring ways to make the agent *continuously improve* its review quality over time — ideally by learning from feedback provided by our developers.

I’ve reviewed your documentation under:

* [Suggestion Tracking](https://qodo-merge-docs.qodo.ai/tools/improve/#suggestion-tracking)
* [Extra Instructions and Best Practices](https://qodo-merge-docs.qodo.ai/tools/improve/#extra-instructions-and-best-practices)

Based on that, I understand that:

* `allow_thumbs_up_down = true` enables collecting user reactions (👍/👎) purely for **statistics**, and
* `wiki_page_accepted_suggestions = true` can log **accepted suggestions** for manual review and evaluation,
  but **these features don’t actually change or “teach” the model** over time.

### Motivation

We’d like to know whether there is (or could be) a mechanism for **incremental learning** or **adaptive improvement** — e.g.:

1. Using accepted suggestions and thumbs-up feedback to gradually refine the model’s responses.
2. Integrating a lightweight retraining loop or fine-tuning pipeline using these logged results.
3. (Alternatively) A built-in RAG-based retrieval system that references past “good” PR comments to guide future reviews.

Essentially, the idea is that over time the agent would “learn” our internal standards and communication style — beyond static `extra_instructions` — and progressively adapt.


### **Questions**

* Is there any ongoing work or roadmap related to self-improvement or adaptive fine-tuning?
* Could there already be a mechanism for this (e.g., through some plugin, RAG integration, or feedback hook) that I might have missed in the documentation?
* If this direction is not currently planned, could you please share any suggestions or best practices on how a team could implement such a feedback-driven improvement layer externally?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ability for PR-Agent to continuously improve using user feedback (learning from accepted suggestions & thumbs-up responses) #2075

Feature request

Motivation

Questions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Ability for PR-Agent to continuously improve using user feedback (learning from accepted suggestions & thumbs-up responses) #2075

Description

Feature request

Motivation

Questions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions