-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Open
Labels
Description
Feature request
Hello Qodo team 👋
We are currently using PR-Agent in our company setup, and we’ve been exploring ways to make the agent continuously improve its review quality over time — ideally by learning from feedback provided by our developers.
I’ve reviewed your documentation under:
Based on that, I understand that:
allow_thumbs_up_down = trueenables collecting user reactions (👍/👎) purely for statistics, andwiki_page_accepted_suggestions = truecan log accepted suggestions for manual review and evaluation,
but these features don’t actually change or “teach” the model over time.
Motivation
We’d like to know whether there is (or could be) a mechanism for incremental learning or adaptive improvement — e.g.:
- Using accepted suggestions and thumbs-up feedback to gradually refine the model’s responses.
- Integrating a lightweight retraining loop or fine-tuning pipeline using these logged results.
- (Alternatively) A built-in RAG-based retrieval system that references past “good” PR comments to guide future reviews.
Essentially, the idea is that over time the agent would “learn” our internal standards and communication style — beyond static extra_instructions — and progressively adapt.
Questions
- Is there any ongoing work or roadmap related to self-improvement or adaptive fine-tuning?
- Could there already be a mechanism for this (e.g., through some plugin, RAG integration, or feedback hook) that I might have missed in the documentation?
- If this direction is not currently planned, could you please share any suggestions or best practices on how a team could implement such a feedback-driven improvement layer externally?