Skip to content

CI/CD: Duplicate Issue Detection and Automatic Marking #3348

@Abhinandankaushik

Description

@Abhinandankaushik

Is your feature request related to a problem? Please describe.

Currently, duplicate issues and pull requests are often created in the repository, leading to redundant discussions, repeated work, and extra maintenance overhead for maintainers. There is no automated mechanism in the CI/CD pipeline to detect potential duplicates and flag them proactively.

Describe the solution (or solutions) you'd like

I propose adding an automated duplicate issue detection mechanism in the CI/CD workflow that:

  • Scans newly created issues and pull requests for similarity with existing ones.
  • Identifies potential duplicates based on title, description, and labels.
  • Automatically marks suspected duplicates with a label such as possible-duplicate.
  • Adds a bot comment suggesting existing related issues with links.
  • Optionally closes the issue if an exact match is found (configurable behavior).

Describe alternatives you've considered and rejected

  • Manual review by maintainers: Rejected because it does not scale and increases workload.
  • GitHub issue templates only: Helpful but does not prevent duplicate submissions.
  • Strict issue submission rules: Too restrictive and may discourage contributions.

Additional context

  • This feature would be especially useful for large repositories with frequent contributions.
  • Can be implemented using GitHub Actions with existing tools like issue similarity bots or custom scripts.
  • Similar functionality is used in repositories like Kubernetes and TensorFlow.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions