-
-
Notifications
You must be signed in to change notification settings - Fork 122
Open
Description
Is your feature request related to a problem?
- Yes, it is related to a problem
Describe the feature you'd like
🌟 Feature Description
Propose an automated CI/CD-based Duplicate Issue Detection and Marking system that automatically scans newly created issues and pull requests, identifies potential duplicates, and flags them using labels and bot comments.
🔍 Problem Statement
Currently, contributors often create duplicate issues or pull requests because they are unaware of existing similar discussions. This leads to:
- Redundant work
- Increased maintainer overhead
- Cluttered issue tracker
- Slower triaging process
There is no automated system in place to proactively detect and highlight potential duplicates.
🎯 Expected Outcome
With this feature implemented:
- Duplicate or near-duplicate issues will be automatically detected.
- Suspected duplicates will be labeled as possible-duplicate.
- A bot will comment with links to similar existing issues.
- Maintainers will spend less time manually identifying duplicates.
- The issue tracker will be cleaner and better organized.
📷 Screenshots and Design Ideas
New Issue Created
↓
GitHub Actions Trigger
↓
Fetch Recent Issues
↓
Run Similarity Check (NLP / Keyword Matching)
↓
If duplicate likely:
→ Add label: possible-duplicate
→ Bot comments with related issue links
Else:
→ No action
📋 Additional Context
- This can be implemented using GitHub Actions + Python script / existing duplicate detection bots.
- Can use NLP-based similarity (embeddings + cosine similarity) or simpler keyword matching.
- The system should be configurable (threshold, labels, auto-close option).
- Similar automation exists in large projects like Kubernetes, TensorFlow, and React repositories.
Record
- I agree to follow this project's Code of Conduct
- I want to work on implementing this feature
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels