Skip to content

CI/CD: Duplicate Issue Detection and Automatic Marking #400

@Abhinandankaushik

Description

@Abhinandankaushik

Is your feature request related to a problem? Please describe.

Currently, duplicate issues and pull requests are often created in the repository, leading to redundant discussions, repeated work, and extra maintenance overhead for maintainers. There is no automated mechanism in the CI/CD pipeline to detect potential duplicates and flag them proactively.

Describe the solution (or solutions) you'd like

I propose adding an automated duplicate issue detection mechanism in the CI/CD workflow that:

  • Scans newly created issues and pull requests for similarity with existing ones.
  • Identifies potential duplicates based on title, description, and labels.
  • Automatically marks suspected duplicates with a label such as possible-duplicate.
  • Adds a bot comment suggesting existing related issues with links.
  • Optionally closes the issue if an exact match is found (configurable behavior).

Describe alternatives you've considered and rejected

  • Manual review by maintainers: Rejected because it does not scale and increases workload.
  • GitHub issue templates only: Helpful but does not prevent duplicate submissions.
  • Strict issue submission rules: Too restrictive and may discourage contributions.

Additional context

  • This feature would be especially useful for large repositories with frequent contributions.
  • Can be implemented using GitHub Actions with existing tools like issue similarity bots or custom scripts.
  • Similar functionality is used in repositories like Kubernetes and TensorFlow.

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions